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Final Report — NSG-1550 
Eugene Galanter 

Psychophysics Laboratory, Columbia University 

The Psychophysics Laboratory and the ANRL of the 
NASA-Langley Research Center have been engaged in a cooperative 
research enterprise since 1975. He have both been trying to 
untangle and explicate the social-psychological, acoustic, and 
perceptual aspects of the human annoyance response to aircraft 
overflight noise. This work has included both laboratory and 
field research. The field work was designed to assess the basis 
of community response to aircraft noise. The laboratory studies 
served to develop models of such response tendencies that might 
identify aspects of the (single and multiple) events that give 
rise to the annoyance responses. 

The objective of these efforts was to find ways to control 
the salient features of the annoyance function, and so reduce 
ccxamunity annoyance responses. We expected that changing 
identified pareuaeters of a model could lead to altered response. 
Environmental variables that can be located by a theoretical 
model may offer a high benefit/cost ratio in effecting a 
reduction of annoyance. 

Our work over the past several years has concentrated on two 
aspects of the general problem: 1) How do the effects of many 

separate exposures to aircraft overflights cumulate within a 
person's judgmental frame to yield a long-term measure of 
individual annoyance. 2) How can we assign a numerical value to 
the magnitude of a person's annoyance that will capture a variety 
of behavioral consequences, such as complaint behavior, community 
action, etc. 

The first question has lead to the development of multiple 
exposure models of human annoyance that broke with mere acoustic 
additivity of one or another kind. During the past one and one 
half years, these various models have been subjected to a variety 
of experimental tests. The culmination of this work is described 
in the dissertation presented to the Graduate School of Arts and 
Sciences, Columbia University by Nancy Haber. Segments of that 
dissertation are included in this Final Report as Appendix I. 

The second question has resulted in the development of a new 
scaling technique, named Utility Comparison Scaling, that permits 
the assignment of numerical values to aversive or desirable 
events. This work has been prepared for publication, and is 
being submitted in revised form and with supplementary data to 
the journal Public Opinion Quarterly. We expect an early 
editorial decision on this revision of PPL Technical Report 
NASA- 5, dated. 

Finally, the general problem of comparative judgment scale 



2 


construction, of which the Utility CtMiparison Scale is a special 
case, has foraed the basis for a dissertation for the Ph.O. 
degree deposited in June 1983, by Richard Popper with the 
Graduate School of Arts and Sciences. Appropriate sections of 
that dissertation are included as Appendix II. 

On looking back at the variety of activities that this 
research support has generated I can only conclude that this 
contribution by NASA has aided the ongoing activities of the 
Laboratory as well as the educational mission of the University. 
The total number of students, both undergraduate and graduate 
that have had their research skills sharpened under the aegis of 
this grant is 19 undergraduate research assistants, and 5 
graduate students of which three have received Ph.O. degrees and 
one has received an M.A. 

During the course of the work, several collateral activities 
were undertaken that were of direct aid to the major thrust of 
this research program. One ccxnponent was a survey designed and 
analyzed for the FAA to assess the impact of the inauguration of 
the Concorde supersonic transport. A second collateral study was 
conducted for the NASA- Langley Research Center to permit the 
cOTiparison under laboratory conditions of canmunity residents 
from the New York City area with residents from Hampton, Va. and 
near-by communities. A final study conducted for the FAA 
provided laboratory data for the analysis of helicopter noise, 
and noise from fixed-wing turbine aircraft in both single and 
multiple overflight conditions. 

In summary, I believe that empirical knowledge about the 
impact of environmental events was accumulated, and used to 
develop quantitative models of human behavior in the face of such 
events. The techniques that evolved have enhanced the 
capabilities of researchers in the social and behavioral 
sciences. We have shown that survey research methods may be 
augmented by psychophysical scaling techniques to the benefit of 
both disciplines. Although we are still short of the goal of 
calculating the direct and indirect effects of aircraft 
overflight noise on comnunity residents, we have improved by a 
considerable degree our understanding of the nature of these 
community judgments, and how they can be measured and altered. 
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ColuRi>ia University 
Psydiqphysics Laboratory 
Mew York, Mew York 10027 


TSeporal Characteristics of Bnotional Response: 

A Decay Model ot Annoyanoe Response to Acoustic Moise 


1.0 General introduction 


Blatz demonstrated in 1925 that physiological re^ionses to a fear- 
provoking stimulus increase sharply at the onset of the stimulus and 
eventually return to their respective resting states. If E^siological 
re^Kxises to enotion-provoking stimuli decay after the onset of the stimuli, 
then it is hypothesized that so viill the emotional state (subject, of 
course, to cognitive and other contextual factors. Buck, 1976; Sdhachter & 
Singer, 1962). A series of experimmits are prqposed to demonstrate the 
existence of this change in emotional state as well as to estimate tenporal 
parameters for specific emotion-provdcing situations. 

Stimuli, such as electric shodc (Mlsbett & Sdiachter, 1966), 
photographs of emotionally-loaded stimuli (Frijda & Van der Geer, 1961) and 
manipulated situations (Elcinan, 1965) , have been widely used to evoke 
emotional responses. However, with the exertion of electric shodc, these 
stimuli provide no natural units and therefore do not avail themselves to 
systematic manipulation of the stimulus variable along the intensive 
dimension. Even though electric shock has natural units (voltage), its use 
has he&n limited due to health and safety concerns. Acoustic noise, on the 
other hand, has very ^lecific descriptive units, as well as a consistent and 
well-established history of evoking an affective response called 
**annpyance.” Kryter (1970) defined annoyance as "the subjective impression 
of the unwantedness of a not unexpeerted, nonpain or fear provdeing sound as 
part of one's environment." 

The method most coninonly used in assessing the degree of annoyance 
produced by noise is the method of self-r^>ort. Since measures of 
self-report are more likely to represent bdiavior than physiological 
measures, in that both behavioral re^nses and measures of self-report are 
modified by display rules, its use is suggested by many researcdiers 
(Hilgard, 1969; Mackay, 1980). 
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The existMoe of nuneroua mwurwents of noise, notably aircraft 
noise, indicates the lack of a reliable, predictive measure of annoyance* 
Noise tMasursmsnts caimonly used in the tkiited States and Great Britain 
represent descriptive models of annoyance, the parameters of these 
measurements^ nunter, duration mid/or amplitude (either peak or average) — 
correlate well witii judged aimoyanoe (Rice, 1977a) . Ibis ««ould seem to 
indicate that the parameters of the noise measurements reflect the major 
variables of the noise that are producing tdie annoyance levels. However 
since most noise measurements have been desicped in an effort to predict 
oGmnunity re^Kxise to noise rather than to model tiie individual annoyance 
re^Kxise function, certain, perhaps more subtle, parameters have been 
over Indeed. 

In essence, the present models of annoyance production merely correlate 
die parameters of duration, number and amplitude of the noise events idth 
the community annoyance re^ionse, umoally to aircraft noise. The dynamic 
environaent, diat is how eaob event in time contributes to the indivichial's 
annoyance leiwl, is not considered in these measurements. Althou^ the 
correlation between the amount of annoyance produced by noise ei^iosure and 
these measures is good for community responses, the models idiicb underlie 
the measures for individual judgments are ladcing a temporal parameter. The 
importance of including not only amplitude, raiober and duration parameters 
in the annoyance response function, but also a Umporal parameter, will be 
demonstrated in eiqieriments that examine the annoyance response function 
produced by acoustic noise. 

In this section, four models of annoyance will be described and tests 
of consistency derived. The four models are a peak energy model, an energy 
averaging model, an energy sunination model and an annoyance decay model. 

1.1 "Peak** energy model 

A simple model of the annoyance rm^ionse function is one in vdiidi the 
responae is represented by the highest miergy level present in a stimulus or 
set of stimuli. An annoyance re^x)nse is gmerated solely on the peak 
energy level presented ^ring a time interval. 

Most si;qpport for the peak energy model is found in studies on the 
effect of aircraft noise on conmunity residents. Many itiVMtigators have 
found that the roaxinun A-wei^ted sound level (sound weighted to 
quantitatively reduce the effect of low frequmxy noise, hence approximating 
the re^xxise of the human ear to sound) occurring during a single aircraft 
overflight correlates with the noisiness produced by the individual aircraft 
flyover (Berglund, Berglund & Lindvall, 1975; Young & Peterson, 1969). 
Inlander proposed that the peak dB(A) — maximum A<weighted sound level — of 
the noisiest type of aircraft that occurs more than three times a day on an 
airfield that hns more than 63 take-offs in 24 hours provides an adequate 
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measure of the ccnnunity noise limit (Inlander & Sorenson, 1973; Inlander, 
Sorenson, Alexandre & Gilbert, 1973; ^lander, Sorenson & Berglund, 1974; 
Inlander, Sorenson & Kajland, 1972; Sorenson, Berglund & Inlander, 1973). 
Inlander et. al. found a linear relation^ip betueen the percental of 
people »4x> were 'highly annoyed" and the peeric dB(A) of the area. This model 
does not a^ly to areas with less than 63 take~offs in a 24 hour period, nor 
does it provide an ei^lanation as to how the diaracteristics of as few as 3 
flyovers can influence the reported annoyance of connunity residents unless, 
of course, the peak dB(A) of the ranaining 60 or so overflights correlates 
«rith the 3 reference flyovers. It follows from this model that, provided 
two ooranunities have the same peak dB(A) value, the characteristics of the 
remaining 60 or so overfli^ts have no effect on the judged annoyance levels 
of the connunity monbers. Other investigators have found this not to be the 
case (Cotuior & Patterson, 1976; Kryter & Pearsons, 1963; Powell, 1980). 

Their results si^iport an energy "averaging" model. 

1.2 Energy "averaging" model 

An energy "averaging" model whidi has been shown to predict connunity 
annoyance jud^nents to aircraft noise quite well is the "equivaloit sound 
level" or Leq model (Eldred, 1975; Powell, 1980; Rice, 1977b; Shepherd, 

1981) . This is the average A-weighted noise level on an energy basis for 
some specified amount of tira^. This method equates the nixnerical value of a 
fluctuating sound to a steady state sound with the same amount of total 
energy that continues for some specified period of time (Pearsons & Bennett, 
1974) . 


Since the Leg model does not take into account time of presentation, 
there is no clear definition as to what constitutes the start of a time 
period, i.e. whether the time starts at some arbitrary point or whether the 
time starts whc»i the first noise occurs. This model is not concerned with 
temporal characteristics and therefore, the time period in t^ich miergy is 
intergrated is usually defined by the researcher as either an arbitrary hour 
or day in comunity studies or as the session length in laboratory researdi. 
Both of these measures are ind^>endent of the time of presentation of noise 
evmits. 

An energy averaging model predicts that: (1) increasing the maximun 
intensity of a constant nuidber of aircraft overflights while spreading out 
their occurrence in time results in constant annoyance; (2) annoyance is 
constant regardless of the number of aircraft overflights as long as the 
average energy level over time is constant (two or more successive 
overflights with equivalent peak dB(A) should result in constant annoyance 
equal to that of a single overflight as long as the intergration time for 
the two overflights is twice that of the single overflight) ; and (3) 
annoyance should not increase as the duration of a signal increases because 
the average energy level is unchanged. 
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Althou^ Sh^iherd (1981) found Lag to be as good as or better thvi 
other noise measures in correlating the pt^ical characteristics of aircraft 
noise with amoyanoe levels it has been shown that increasing the maxinum 
intensity of four overflights while feeding out their occurrence in time 
(so that Leg remains constant) does not result in constant annoyance, 
increasing the number of overfli^ts while leading tiw peak dB(A) level 
constant does not result in constant annoyance ((Salanter & Popper, Note 1) , 
and increasing the duration of a signal does not result in constant 
annoyance as the Leg model would predict (Kryter & Pearsons, 1963; Molino, 
1979) . Many investigators have found an energy "sunmation" model to predict 
annoyance responses better than either the peak dB(A) or the Leg model. 

1.3 Energy "sanaation** model 

Connor and Patterson (1976) have shown that annoyance increases 
steadily with increases in the total energy level for constant daily 
aircraft operations and with numbers of operations. Ibis result implies 
that eunnoyance judgments are d^>endent not only on the energy level of the 
single overfli^ts, but also on the nucober and duration of events. This 
result introduces the importance of number on the calculation of annoyance 
judgments. 

A measure of this type is Ls. Ls is the energy sunmation of some noise 
level (A>weighted) for some specified period of time. Annoyance accumulates 
continuously according to Ls. Several investigators hiave found that the Ls 
model predicts annoyance responses in laboratory settings better than the 
Leg model (Balin, Haber & Popper, Note 2; Galanter & Popper, Note 1; Haber & 
Karsten, Note 3; Perera, Popper & Galanter, Note 4). 

It appears that for groups of aircraft overflights occurring over 
relatively short time intervals, an energy summation model predicts 
annoyance le^l judgments better than the peak dB(A) or Leg models. 

However, none of these preposed models account for temporal variations 
within a fixed time period. Researchers have found that signals 
characterized as approadiing, i.e. peak amplitude near the end of the 
signal, are judged more annoying than receding signals assuming constant 
average and peak amplitude over the signal duration (Haber, Note 5; Nixon, 
von Gierke & Rosinger, 1969). Peak dB(A), Leg and Ls would predict no 
difference in annoyance judgments between approaching and receding signals 
because the overall and peak energy levels are the same for the duration of 
the signals. 

Molino (1979) reviewed experiments in whi^ only rise times were 
varied. Noises having rise times greater than three seconds were judged to 
be more annoying than those that readied their maximum level in about one 
second, even though the total duration and total energy were the same. Ibis 
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CMult may indlicata that vAmh more auditory energy occurs at the end of a 
time period (long rise time) , the arm^anoe response is greater than idien it 
occurs at the beginning (^rt rise time) provided the overall energy is the 
sane. The possibility exists that annoyance level may decrease over a 
period of time, so even though noises %dth hi^^r peak levels are more 
annoying than noises with lower peak levels at the time of noise 
presentation, the annoyance level as time between noise events 

increases and time after noise events progresses. 

1.4 Annoyance "decay" model 

A decay model of annoyance level assianes that annoyance decreases as 
time passes. A burst of noise results in extrme annoyance imnediately 
following the evmt, but hours wid certainly days later the annoyance level 
is greatly reduced. 

The decay model consists of an emotion-provoking event and a req»nse 
function. Suppose the euinoyance response system is viewed as a general 
linear filter. A general linear filter consists of a set of weights (g^, 

•••» ?s)« that, if the input to the filter is some function (y^), 
the output IS 


*t 


s 

■^TguYt-u* 

u»r 


If the input is an impulse (a single, non-zero value), the ou^njt consists 
of the weights (Bloomfield, 1976) . The weights are also known as the 
impulse response function. To account for a decrease in annoyance level 
with the passage of time, as the decay model would predict, tte impulse 
response function ^ould be a decreasing function. 

Since other researchers have failed to confirm a decay model through 
either a post hoc (Sh^;^rd, 1981) or an a priori design (Powell, Note 6), 
an ei^lanation of their results is warranted. ^Iht^iherd dMigned his 
ei^riment to examine **cumulative annoyance due to multiple aircraft 
flyovers with differing peak noise levels.” He did not set out to examine 
an annoyance decay model, but considered the annryance decay model by 
calculating new Leg values for test sessions by using a range of decay rates 
of 0 to 20 dB per hour for eadi session. The correlation coefficients 
between mean annoyance judgnents and these new Log values did not improve 
over tne correlation between mean annoyance and the unadjusted Leg value. 
Therefore he concluded that the data did not support the decay hypothesis. 

Powell, on the other hand, set out to study the effect of 
time-of-occurrence on annoyance judgments. His experiment consisted of 1, 2 
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or 4 hi^noiM l«v«l flyovors occurring at tha beginning# middle or end cf 
a 30<fliinute teet session containing a total of ei^t flyovers. The peak 
dB(A) of the hi^ and lov-noise level flyovers was# rei^iectively# 79.2 dB 
and 67.2 dB. The results of an analysis of variance witii annoyance 
re^)onse8 as the d^>«idmt measure indicate that dw time-of-occurrence of 
the high-noise level flyovers was not a sig^^Ificant factor. Although an 
increase in annoyance is r^rtsd when the hig^i-noise level flyovers occur 
at the end of the session# there is no consistent trend as time between the 
occurrence of the high-noise level flyovers and the end of the session 
decreases. Because the eiqperimental design is inconplete# Powell could not 
test for interaction between time-of -occurrence and number of high-noise 
level flyov^s. Therefore he concludes "the ei^eriment does not provide 
conclusive information to justify ccnpletely discounting any 
time-of-occurrence effect (Powell# Note 6# p. 7)." 

Powell's second ei^riment tested the effect of session lengtii and 
nuRber of flyover noises on annoyance judgments. The durations of the 
sessions that he examined were 15# 30 and 60 minutes and the sessions 
contained 1# ^ or 16 flyovers evenly distributed in time with a fixed peak 
noise level of either 79.3 dB or 61.3 dB. It should be noted that «ben one 
flyover occurred# Powell placed the occurrence of that one flyover directly 
in the middle of the session. His results indicate that annoyance decreases 
as session Imgth increases. This result supports an Leg (energy averaging) 
model and does not support an : » (energy summation) model. As in the first 
experiment because of his experimmtal design# Powell was unable to examine 
the interaction between mmber of flyovers and session duration. Therefore# 
he is unable to conclude as he was unable to conclude in his first 
ekpsrimmt that the annoyance decay model cannot be sv^iported. In fact his 
results in this periment do support an annoyance decay model. As session 
length increases# the time between overfli^ts increases. Therefore# the 
time in which the anne^noe to each overflight decays increases. If the 
annoyance to each overflight decays as time passes# the annoyance judgments 
made at the conclusion of th* test session should be nudh less at the end of 
a 60-minute session than they would be at the end cf a 15-minute session. 
Injects in these experiments# as is the case in most experiments which test 
models of emotional response# are asiced to attmd to the noise eitiier 
exclusively or while engaged in passive activities. Subjects are usually 
asked to "relax and read or do any needlework [they] may have brought with 
[them]” (Powell# 1980# Note 6; Shepherd# 1981). It is fypothesized that 
this lack of activity does not encourage "reported" annoyance to decay. The 
request of subjects to attend to noise exclusively inplies a sort of 
accounting procedure where subjects hear a noise# evaluate the noise and 
store this information for future reference. As we know in real-life as 
noise occurs around us# we are not usually attending exclusively to the 
noise. Thus the noise occurs and the event passes without any conscious 
analysis of the event. Therefore# engaging subjects in a task, the 
occurrence of noise and the evaluation of the annoyance produced would 
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sinulate the disturbance of noise in a natural setting. 

In his researchr Bor^cy engaged subjects in a task of iiatching a color 
television program so as to recreate the disturbance of aircraft flyovers in 
a nacural setting (Borsky, 1977; Bor^cy & Leonard, 1973). So as not to 
confuse annoyance %dth auditory masking effects, subjects diottlc participate 
in a non-auditory ta^. Reviews on the effects of noise on performance 
conclude that the majority of publidied studies indicate that noise is 
likely to have no effect on performance for vigilance or conpensatory 
tradcing tasks (Coates, AiScins & Alluisi, 1973; Coates, Alluisi & Adk'ns, 
1977; Lod>, 1975). Therefore, it is suggested in this researdi that 
subjects be adced to judge their degree of annoyance to noise events that 
occur while are engaged in a co m pe n satory tradcing task. 

This research is conducted to test the models of annoyance re^xxise to 
noise and estimate pauraneters for the re^poi:st* function for individual noise 
events. In the first eqieriment, subjects are asked to make annoyancs 
judgments at the end of time periods of various durations in whidi bursts of 
white noise are presented. According to the decay model, the placement of 
these noises in time affects the annoyance jud^nents of individual subjects. 
For example, noise events presented at the beginning of a time period are 
perceived as less annoying than noise events presented at the end of a time 
period provided that the judgment is asked for at the end of the time 
period, and the total energy levels during the time periods are the same. 

The pe^ ®(A), Leg and Ls models %#ould not support the temporal effect as 
long as the pe^, average and total energy levels were tiie same. The final 
eiperiments are designed to test the form of the deca;^ function for single 
bursts of noise and to compare the decay constants for bursts of va 'ious 
amplitudes. 

The four factors examined here are the session length, the mxnber of 
noise bursts, the amplitude of the bursts and the time of presentation of 
these noise bursts. Because these four facrtors are relevant to the four 
models presented in the introduction, inclusicxi of these factors 
simultaneously in one experimental design provides a test of these models. 

This experiment is designed in a way so that the size of the block 
(nxiber of sessions/subject) can be reduced althou^ that means sacnrificinr; 
information on certain high-order interactions. By partially confounding 
rather than completely confounding high-order interactions, information is 
available on che confounded interactions. Since these ef^x;ts are 
confounded, special calculations are required (for a detailed discussion of 
the experimental design and data analysis, see ^paidix A). 

An analysis of variance can determine Whi(^ model best represents the 
anncyance response function. The peak dB(A) model would be supported if a 
significant difference existed for amplitude only. Other factors such as 
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session length, number of events and time of presentation have no bearing on 
response level according to the peak dB(A) model. The energy averaging 
(Leq) model would be supported if session length, number of events and 
anplitude were significant factors, but not time of presentation. The 
ener^ summation (Ls) model relies only on nmnber and amplitude. However, a 
significant difference on time of pres^tation would support the hypothesis 
that a temporal parameter increases the predictability of reqpcnse measures 
of annoyance in a dynamic environnent, that is the annoyance decay model. 

To verify that indeed no noise effects on task performance exist, some 
subjects will not experience any noise interference while engaged in the 
compensatory tradcing task. Performance is compared between noise and 
no-noise experimental conditions to determine what, if any, noise effects on 
task performance exist. In order to test the hypothesis that exclusive 
attention to noise encourages subjects to "hold on" to annoyance levels in 
anticipation to re^xxme time, some subjects are asked to make annoyance 
judgments %<hile sitting quietly in a room reading magcizines. Comparisons of 
annoyance re^xxise functions will test this hypothesis. 

2.0 Method - Experiment 1 

2.1 Subjects 

The eiqierifflental design for Ejq)eriment 1 requires 36 & (see Appendix A 
for details) . The sessions for two Ss were run again using two other Ss 
be c ause the original two Ss used "percentage annoyed during session" 
re^xxmes instead of magnitude estimation re^xxises as the instructions 
stated (their re^xxises were given in terms of percentiles) . Also, one S 
had difficulty with the compeisatory tradcing task, so his sessions were 
also r^laced by another S. Most of the Ss, 26, were male. All Ss were 
between the ages of 17 and 35. A subset of the 36 & (Replication 2: nine 
see Appendix A, Table A2) were called bade to nin in a "no noise" 
condition and an adoitional nine Ss ran in a single r^lication (Replication 
2, see j^jpendix A, Tedale A2) of the experimental design in a "no task” 
condition. Ss were paid $4.00 for participation. All & reported normal 
hearing. 

2.2 Stimuli 


Ss were presented with six sessions (3, 6 or 9 minutes in length) in 
which ensendsles consisting of 1, 2 or 4 bursts of white noise with peak 
amplitude of 74, 80 or 86 dB(A) were presented either early (beginning 30 
seconds after the start of the game session) or late (ending 10 seconds 
before response time) . Bursts of noise remained at their peeik amplitude for 
10 seconds (250 msec, rise and decay times) . Bursts within an assemble were 
s^mrated by a 10-second interstimulus interval measured from the end of one 
ramp to the start of the next. Slow rise times were used in this experiment 
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so as to redooe the "startle” effec± for late presentations. Since the time 
from the start of the session varied for late sessions making the time of 
occurrence dependent on session lengthy this precaution equated the late 
presentations by eliminating the addition of startle to the annoyance 
judgnent. The noise bursts in the early sessions started 30 seconds after 
the start of the session as opposed to being presented at the onset of the 
session. This was done to insure that & were eictively engaged in the task 
for presmtations of noise in both early and late sessions. 

2.3 Apparatus 

A block diagram of the audio-control system is presented in Figure 1. 
White noise was generated by a General Radio 1382 random noise generator and 
fed to an audio switcn. An Automated Data Systems (ADS) 1800E process 
control system gated the noise and controlled the aoplitude of the signal by 
the use of a prograDomable attenuator. The ^tem* equipped trith an internal 
clodc, also timed the start of preseitations of ensembles and the 
interstimulus interval times. The output was ^lit and fed into a Crown IC 
150A preanplifier with calibrated step-attenuator intensity control. The 
output was th*n fed into a Crown VFX-2A crossover filter set at lowpass 20 
Hz. All frequaicies greater than 20 Hz were passed to the loud^teakers. 
Frequencies less than 20 Hz were passed to a dead-end output. The signal 
was then fed into an Altec Leuising 729A stereophonic equalizer which t«as 
previously calibrated to correct for room acoustics. The equalizer was 
cedibrated by feeding pink noise into the room (70 dB(A)) auid adjusting the 
specific frequencies to result in a "flat" octave band analysis, i.e. equal 
energy cKnross all bands. The signal was fed into a Burwen CNF 1201 noise 
suppressor whidi filters out tape noise. Since the signals were generated 
at the time of presentation and not recorded on tape, the Burwen was 
switched "exit" of the system. The signal was then ^lit: one-half fed into 
a Crown D 75 amplifier which drove two Altec Lansing Voic» of the Theater 
speaker systems (22 feet in front of S) , the other half was fed into a Crown 
D 150A amplifier which drove two Acxxistic Research 3-A Icxdspeaker systems 
(8 feet to the right and left of S) . 

The compensatory tracking task and the instructions were g&ierated on a 
Conmodore PET 2001 Series computer which included a Commodore CBM Model 2040 
dual floppy disk drive. The task was a professionally-supplied game program 
packaged under the name "Demon game" (for a copy of the computer presgram, 
see Jeffries & Fisher, 1981) . The progren was modified to allow S to 
control the game from a peripheral devico. The video output from the 
computer was viewed by £ on a cathode-ray tube (CRT) placed directly in 
front of him. An Atari jc^stick control allowed S to play the game as well 
as advanco the instructions. Communication between S and E during the game 
sessions was permitted via an audio/visual intercom system. The experiment 
took place in a 16' X 26' rexxn furnished in the style of a typical 
middle-class living room (Figure 2) . 
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2.4 Procedure 

S was greeted as he entered the laboratory and was led into the 
simulated living rocnir seated on the couch and asked to sign the 
Psychophysics Laboratory Subject Informed Consmt Form (i^ipendix B) . E then 
read the instructions (Appendix C) while S followed along on the videoT 
monitor (CRT) . Ss were told that the e^riment was designed to assess the 
reaction of peoole to various noise environments — how annexing noise is 
perceiv^ to be and how noise affects work perfonnance. An exercise in the 
use of the joystick was followed by an abbreviated version of the Demon 
game. Ihe abbreviated game ended tih^ either five demons were caqE>tured or 
when three minutes expired, whic^mver came first (always the former) . Ss 
were next informed that the person who captured the most "demons” during the 
six sessions received a borus of $25.00. All & wrote their names, 
addresses and telephone numbers on a sheet of pe^r. This was done in the 
belief that receiving this information would reassure ^ of ^ intent to pay 
the prize. In fact, E did pay the person \i*JO captured”the most demons at 
the conclusion of thi? series of experiments. Next, the magnitude 
estimation procedure by vi^ich S used to coranunicate his degree of annoyance 
was explained (Stevens & Galantier, 1957} . S was asked to "rate the 
annq/ance of the ... noises in comparison to the anne^anoe produced by the 
standard [noise] ... [and to] be sure that the nuobers represent the 
annc^ance produced during eadi session in proportiwi to the annoyance 
produced by the standard ..." E then left the room and presented S with the 
standard stimulus (one lO-seconU burst of white noise at 80 dB(A)) foll(x^ 
by six gane sessions. S controlled the start of a game session by pressing 
a button on the Atari joystick. E detennined v*i«j the session was over 
(either 3, 6 or 9 minutes) and caused the gane to terminate. The next 
session started with a new game when S pressed the button on the Atari 
joystick. At the end of each game session, S was asked to verbally give his 
annoyance response to E. At the end of the six game sessions, a short 
questionnaire ()^?pendi}( D) was presented and completed by S. An 
experimmital session lasted one hour. £ was paid a particTpation fee of 
$4.00, ddsriefed and dismissed. 

The procedure for the no-noise condition was identical to the procedure 
for the previous experiment exc^t that instructions regarding magnitude 
estimation were omitted and no judgments on noise were made. Ihe standard 
noise burst as well as noise during sessions were omitted. The 
questionnaire contained only items that did not refer to noise. 

The proesdure for the no-task condition was the same as the procedure 
for the general experiment. E read the instructions While S followed along 
on the CRT. No mention was made of the task. The magnitude estimation 
procedure was explained. S was told that he could read maigazines that werv.- 
left for him if he wished (current issues of Newswe^ , People and U.S. News 
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& World Rgx)rt) . Noise was then presented as in the task condition. Items 
Tn the ju^tionnaire referring to the game were omitted. 

3.0 Results and Discussion - E:g)eriinent 1 

The main effects in this experiment %#ere of primary interest. A 
significant effect solely on amplitude would sc^iport the peak dB(A) model; 
significance on number and amplitude would sv^iport the Ls model; and 
significance on session length, raxnber and amplitude would support the Leg 
model. However, only the decay model would predict a significant difference 
on time of presentation. 

A logeurittxnic transformation was performed on the ratw data responses 
because error variances around large nixnerical re^x>nses are greater than 
those around small numerical responses. A logarithmic transformation 
results in more equivalent error variances. 

The geometric mean responses for all main effects and interactions are 
presented in Table 1. The means of confocmded interactions are pooled 
across blocks in this table. The results of an analysis of variance (Table 
2) support the hypothesis that number, amplitude and time of presentation 
are important parameters in the determination of annoyance level (p<.01) . 
Peak dB(A) is the most inportant factor in determining annoyance level 
(Figure 3). The straight lines in the figures are the result of a 
regression analysis using the geonetric means in Table 1 as the dependent 
variable. In addition, the number of bursts (Figure 4) and the time of 
presentation are also important parameters, but to a lesser degree. Bursts 
of noise presented early in a session were r^X)rted to be mudi less annoying 
than bursts presented late in a session (p<.01) when subjects were asked to 
re^ond at the end of a session. This result supports a decay model of 
annoyance level. 

If an annoyance level decay process was to be supported, one would 
expect that the interact i(m between session length and time of presentation 
would be a significant one. That is, the longer the session length, the 
smaller the response should be for early signals (the more time for the 
ann<^ance level to decay), whereas session length should have no effect on 
late signals. The results indicate that annoyance responses to bursts of 
noise presented late in a session remain fairly constant as session l«igth 
increases (Figure 5, solid line); whereas the annoyance to bursts presented 
early decay as the length of the session increases (dotted line) . In fact, 
the decay for annoyance responses to early presentations accounts for the 
session length by amplitude interaction. However, as can be seen in Figure 
6 (interaction between session length, anplitude and time of presentation), 
annoyance responses for bursts with anplitudes of 86 dB(A) and 80 dB(A) 
decay with time, whereas annoyance responses for bursts with amplitixies of 
74 dB(A) do not appear to decay. 
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TABLE 1 

GEOMETRIC MEANS OP AIM)YANCE RESPONSES F(» MAIN 
HETBCTS AND INTERACTIONS FOR EXPERIMENT 1 


Session Length (A) Number (B) 

Amplitude (C) 

Presentation 

Time 

(D) 

3 minutes: 98 

1: 

84 

74 dB(A): 57 


Early: 

84 


6 

: 85 

2: 

83 

80 : 86 


Lats 

• 

• 

101 


9 

: 92 

4: 

110 

86 : 157 








A X B* 



A X C* 




A 

X D 



3 6 9 



3 6 9 




3 

6 

9 

1: 

102 68 86 


74: 

55 5l 67 



E: 

98 

80 

75 

2: 

90 84 76 


80: 

92 80 87 



L: 

98 

91 

115 

4: 

102 107 122 


86: 186 150 138 








B X C* 



B X D 




C X D 



12 4 



12 4 




74 

80 

86 

74: 

5l 47 7l 


E: 77 77 117 



E: 

77 

77 

ITT 

80: 

87 80 92 


L; 92 100 111 



L: 

60 

91 

187 

86: 

126 154 198 











A X B X 

D* 




A 

X C 

XD* 




3 6 


9 

11 

1 



6 




E ” L E “ L 

E 

L 

E 

■ L 


E 

L 

E 

“ L 

1: 

99 105 65 7l 

7l 

104 

74: 5l 

59 


57 

5T 

63 

7l 

2: 

97 84 66 107 

53 

109 

80: 102 

83 


80 

81 

67 

112 

4: 

99 106 120 96 

112 

134 

86: 181 190 

124 

181 

100 

191 





B X C X D* 










1 

2 

4 








E 

L 

E L E 


L 






74: 

50 

60 

45 50 75 

72 






80: 

87 

86 

63 102 99 

86 






86: 

103 

153 

122 194 178 

221 






* means pooled with block effect 
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TABLE 2 

ANALYSIS OF VARIANCE OBTAINED USING A LDGARITWIIC TRANSFORMATION 
Oe AtmOYANCE RESPONSES FCXt EXPERIMENT 1 


SOURCE CP VARIATION 

SS 

DF 

MS 

P 

Between Blocks 

5.5088 

li 



R^licati<xis 

0.2213 

3 

0.0738 

0.44 

Blocks w/in R^>s 

5.2875 

32 

0.1652 


Within Blodcs 

19.3510 

180 



A (Session Length) 

0.1373 

2 

0.0687 

2.54 

B(Nuntoer)** 

0.6824 

2 

0.3412 

12.65** 

C (Amplitude) ** 

6.9030 

2 

3.4515 

127.99** 

D(Tiine) ** 

0.3377 

1 

0.3377 

12.52** 

A X B 

0.1860 

4 

0.0465 

1.72 

A X C** 

0.3845 

4 

0.0961 

3.56** 

A X D** 

0.3329 

2 

0.1665 

6.17** 

B X C 

0.0498 

4 

0.0125 

0.46 

B X D* 

0.1994 

2 

0.0997 

3.69* 

C X D 

0.1622 

2 

0.0811 

3.00 

A X B X D** 

2.4097 

4 

0.6024 

22.33** 

A X C X D** 

1.7723 

4 

0.4431 

16.43** 

B X C X D** 

1.9008 

4 

0.4752 

17.62** 

A X B X C X D 

0.2525 

8 

0.0316 

1.17 

Error 

3.6405 

135 

0.0270 



** 


indicates significance at 0.01 level 
indicates significance at 0.05 level 
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The significant interactions in Table 2 that pertain to number of 
bursts reflect the similiarity between one and two bursts and their contrast 
with four bursts. Hie effect of decay on nixrber of noise events is most 
easily seen in the interaction between session length, number and time of 
presentation (Figure 7). The slopes of the best-fitting lines for late 
bursts as a function of session length (solid lines) are approximately zero 
for one, two aix3 four bursts. Annoyance responses decay for one and two 
bursts presented early as was predicted by the decay mo^l, but the 
annoyance produced by four bursts does not decay at all (dotted lines) . Hie 
interaction between number and time and the interaction between number, 
amplitude and time support this result as well. It appears that either four 
bursts leave a very strong impression and do not decay at all or four bursts 
decay less quickly than one or two noise bursts. H)e case may be that the 
time lietween the end of the noise ensemble and the response time in the nine 
minute sessions (approximately seven minutes) is still not sufficiently long 
to result in a decay of r^xirted annoyance, just as there is no significant 
difference between early and late presentations in the three minute 
sessions. 

In order to determine whether the presence of a task had any effect on 
annoyance response and whether the task prevented the subject from 
r^iearsing his response and ther^ basing his judgm«it on his emotional 
re^xinse at response time, an additional group of nine subjects reui in a 
single replication of the experimental design (Replication 2) in a no-task 
ctmdition. One of the questions in the questionnaire answered by all 
subjects asked them to recall the nixiiber of noise events in each session. A 
t-test was performed on the absolute error totals for subjects in the task 
and no-task conditions and was not found to be significant. Although the 
memory for the events in both conditions was not significantly different, 
the annoyance responses as a function of delay to response time are 
different. As was hypothesized, the annoyance response function does not 
decay for bursts of noise presented in the no-task condition. Figures 8 and 
9 represent the interactions between session length and time of presentation 
for the task and no-task conditions, respectively. Figure 8 is based on 
nine of the subjects who ran in R^lication 2 in the task condition (the 
apprc^riate subset of the 36 subjects who ran in the full Experiment 1 
task-noise condition). As can be seen in Figure 8, the annoyance responses 
for bursts in the task condition presented early decay as session length 
increases (dotted line) . The annoyance responses for late bursts remain 
relatively constant. On the other hand for subjects who were not involved 
in a task, as Figure 9 demonstrates, annoyance responses do not decay. The 
slopes in the task and no task condition for the early presentations are 
significantly different from each other (p<.05). In conclusion, the reason 
why researchers have not found decay effects in the past may have been that 
they were not optimizing the decay process by introducing the presence of a 
task. Hiis means that any value of the decay paramet .>r that one calculates 
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is oxitext sfjecific. 

As was discussed in the introducticm, since the Leq model does not take 
into account time of pres&itation/ there is no clear definition as to »rtiat 
constitutes the start of a session, i.e. whether the time starts when the 
trial begins regardless of when the first noise occurs or whether the time 
starts when the first noise occurs. If the latter is the case, then one 
could argue that the results of the analysis of variance su^iport this 
version of the Leq model because sessions in which the noise burst occurred 
late in the session would have a higher Leg value than those in which the 
noise burst occurred early. If Leg were calculated using the start of the 
first burst in this experiment as the start of the session, then this 
version of Leg would predict that annoyance re^nses to the late bursts 
would be much more annoying than responses made for the early bursts even in 
the three minute sessions. Using the three mirnite session as an exanple, in 
the early sessions energy would be averaged over three minutes, whereas in 
the late sessions energy would be averaged over anyvhere from 20 to 80 
seconds. Iherefore, the Leq value for the early sessions would be much 
lower than the Leq value for the late sessions (for single bursts there is a 
nine dB difference) . The results of this experiment do not stq^port this 
definition of Leq. Ihis can be seen in the A X B X D interaction (see Table 
1) . The geometric mean response for the single early burst presented during 
the three-minute session is 99, %^reas the geometric mean response for the 
late single burst is 105. Since the Leq values for the early and late 
session differ by 9 dB, one would expect a doubling of annoyance between the 
early and late sessions. This is not the case in this experimoit. 

In order to determine whether the interest level or past experience 
with video games haKi any effect on annoyanc:e judgnnmts, correlaticxrs between 
the geometric mean of annoyance response for each subject and the amount of 
time subjects reported they sp«it playing video games in the past month and 
the interest level of the Demon game were c:alculated. The results indicate 
no significant correlation between amount of time spent playing video games 
in the past mcxith and annoyance responses nor between the interest level of 
the Demon game and the auvioyancre responses. These results would indicate 
that the range of interest levels in the game had no significant effect on 
annoyance jud^nents. However, there was a significant negative correlation 
between the amount of time spent playing video gentes in the past month eux3 
the interest level of the Demon game (p<.01) . This result is not 
surprising. Those subjects who played video games reported that they played 
rather sophisticated ones and found this game to be of less interest to 
them. However, those that never played found the Demon genie to be more 
interesting. The reason for choosing the Demon game over other games that 
were available was that familiarity with video genes seemed to have little 
effect on the ability to play this game. The results of a correlation 
between the number of demons caught and the amount of time subjects r^»rted 
they spent playing video games is significant at the .01 level. Though 
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this result iinplies a strong correlation between muter of demons caught and 
time q)ent playing other video genes, in fact the only subjects whose 
responses siqipocted this effect ««ere those who qent a great deal of time 
playing video games (th^ captured many demons) and those vte never had 
played any other video games (they captured the least number of demons). 
However, the vast majority of subjects in this experiment were sonewhere in 
the middle (n»31) in whidi the effect of prior exposure to video g&mes is 
minimal. Since there is no significant correlation betweai nunter of demons 
caught and annoyance response, the Demon game appears to be a suitable 
background task for the study of tenporal effects on anne^ance response. 

To confirm that noise did not affect task performance, nine subjects 
were called back to run in a no-noise condition. An analysis of variance on 
raxiter of demons caught resulted in session length as the only significant 
factor (Table 3). That is, given more time, subjects capture more demons. 
The presence of noise in this experimental paradigm had no significant 
effect on task performance. 

Ihough the results of Experiment 1 support a decay model of annoyance 
level as a fimction of time to response, several adjustments tu the 
experimental procedure are suggested before tests of the form of the 
function may proceed. First, the instructions regarding the annoyance 
response appear to be ambiguous. Ihough they replicate the instructions in 
Powell's and Shepherd's experiments, the annoyance responses made te 
subjects were probably one of two types. One is retrospective in nature and 
the other is truly based on the subject's current emotional response. 

During short time periods, the memory for auditory events is rather good, 
and hence subjects can respond with their annoyance level at the time of the 
noise bursts. Therefore, late presentations are always analyzed at response 
time and memory trace of the noise event is excellent. However, as time 
between event and response time increases, other types of annoyance 
judgments may be made. Some subjects may respond with their currait, 
decayed emotional response (very low) while others might respond with either 
their recollection of their emotional response following the ev^t or their 
memory of the sensory event and their translation of that event into an 
emotional response (much higher). In the latter case, subjects are saying, 
”If I heard a noise that loud, I would be very annoyed." Therefore, he 
reportf a high level of annoyance. In fact, this experim^t relies on these 
distinctions between annoyance responses, as do all experiromits vhidi use 
magnitude estimation procedures. Subjects are asked to compare their 
current annoyance levels with the standard annoyance level, an annoyance 
level they felt in the past. Their recollection of the «^tional response 
following the emotion-provdeing event is excell«it. Otherwise, their 
annoyance responses made to the sixth session would be significantly higher 
than their responses made to the first session. Afterall, the annoyance 
produced \y/ the standard has decayed significantly by the end of the 
experimait. The annoyance responses made by subjects indicate that their 
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TABLE 3 





ANALYSIS OF VARIANCE OBTAINED USING NUMBER OF DEMONS CACX3iT AS 


DEPENDENT MEASURE, 

NOISE VERSUS NO-NOISE CONDITIONS 


SOURCE OF VARIATION 

SS 

DF 

MS 

F 


A (Noise Condition) 

15.5740 

1 

15.5740 

C.25 


S (Subjects) 

4871.3703 

8 

608.9213 



A X S 

484.2593 

8 

60.5324 



B (Session Length) 

54868.4815 

2 

27434.2408 

2675.43** 


B X S 

1702.1852 

16 

10.2541 



A X B 

43.8148 

2 

21.9074 

0.44 


A X B X S 

790.8519 

16 

49.4282 



** indicates significance at 0. 

01 level 
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recollection of the standard ann^ance does not decay significantly with 
time. Therefore# one would expect responses to vary greatly in conditions 
where the time between event and response time is large because the 
distinction between these two types of re^)onses is greater. The results of 
a Wilcoxcn Matched Pairs Signed Ranks Test for the variances of re^x>nses 
for each experimental cell indicate that# indeed# there exists a significant 
difference between variances of responses made for early presentations and 
variances of rei^nses made for late ones (p<.05)# where variances are 
greater for early prc.antations. Since an explanation has beai found to 
account for the non-significant results of Powell's and Sh^)hsrd's 
experiments# i.e. presence of task# more specific instructions regarding the 
time of the euialysis of response are included in Experiment 2. In order ''O 
focus on current annoyance response as opposed to remembered annoyance 
level# subjects are asked to respond to the annoyance level they feel at the 
time of the response# nrt the annoyance level they felt at the time of the 
noise presentation. 

Since this study is comparative and not absolute# it does not set 
criterion for predicting annoyance judgments. The study establishes which 
parameters are suitable to include in criteria. Since the decay parameter 
is specific to the presence or absence of a task# the parameter is estimated 
using a ^>ecific context# i.e. the Demon game. Ihe generalizability of the 
estimate of this parameter remains to be seen# bat this context maximizes 
the decay parameter so that estimates may be made over relatively short time 
periods. Only the parameters of anplitude and decay time are examined in 
the next experiment. By adding other decay times to the annoyance function# 
the form of the annoyance response function may be tested. By viewing the 
burst of noise as an impulse# the responses to bursts of noise after a 
period of time would represent the impulse response function. Since the 
rate of decay may be dependent on amplitude and may be "bottoming out" at 74 
dB(A)# several amplitudes greater than 86 dB(A) are introduced. The 
standard noise should have an amplitude in the center of the range so as to 
avoid "range effects" (Poulton# 1968). Poulton discovered that when the 
physical magnitude of the standard is near the lower end of the range# 
values smaller than ,he standard give a steeper sl(^ than values larger 
than the standard. By placing the standard in the center of the range# this 
effect diminishes. 

Since session length had no significant effect on annoyance response in 
the range of three to nine minutes# session length in Experiment 2 will be 
confounded with decay time. All noise bursts will start after 50 seconds of 
game time and last for 10 seconds. The session length will be equal to the 
decay time plus one minute (the time needed to present the burst of noise) . 
Since experimental design considerations limit the study of number of noise 
bursts at this time# the remaining studies are only concerned with the 
annoyance response function for single bursts of noise. 
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4.0 Method - Eiperiaent 2 

4.1 Subjects 

Mine §s Mere recruited by placing posters in and around the Colunbia 
Ikiiversi^ caopus. Host of the Ss» 8» %«ere male. & sere paid $20.00 for 
participation— $4.00 per hour for a total of 5 hours. All Ss r^rted 
nonaal hearing. 

4.2 Stiauli 


The time between presentation and response, and the aeplitude of a 
10-second noise burst «dthin a gaoe session varied in a 9X5 factorial 
experiment with repeated measures on all factors. All noise bursts occurred 
SO s ec o n ds after tiie start of the game session. The time between tiie 
conclusion of the burst and the end of the game session was 0, 1, 2, 3, 4, 

5, 6, 7 or 8 minutes; and the peak amplitude of the burst was 75, 80, 85, 90 
or 95 dB(A}. In this way, several dec^ times and peak amplitude %iere 
analyzed in a within-subject design which in turn reduces error variances. 
Beca use the nuiher of days a subject is reguired to participate in this 
design is five, it is not nec e ssary to use a cxxifounded design as was the 
case in Experiment 1. 

Bee .use the time at thich the noise burst starts with regaurd to the 
start of the gane session is constant for all trials, the noise bursts need 
not be ramped on and off. These bursts of noise may now be viewed as an 
impulse of noise rather than the sun of impulses which ramp on in 250 nsec, 
to the peak and ramp off in 250 msec, after 10 seccxids. The resulting 
amgyance function may be vi e w e d ais the impulse response function as 
discussed in the introduction. 

Also, since it is no longer necessary to raplicate the procedures of 
Powell and Sh^herd, the standard noise wais presented witliin a game session. 
The standard was a 10-second burst of white noise with pe=»k amplitude of 85 
dB(A) which occurred 50 secxHids frexn the start of a 1-minute game sessicn. 
This cdiange in procedure makes judgments betwe^ the standard and the test 
srssicxns more cxnparadole. Both judgments are new being made in the same 
exmtext. 

4.3 Apparatus 

The audio-control system is represented by the blexdt diagram of Figure 
1 and described in the Method section rf Experiment 1. 

4.4 Procredure 


S was greeted as he entered the lai rate 


Ltsd into the 
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sioulated living zoom, seated on tiie coudi and asked to sign tie 
PsyduCtiysics Lainratocy Subject Informed Consent Form (^^ipendix E). 

Changes Mere n ec e ssary in the ScAiject Informed Consent Form used in 
Expuiment 1 (Appendix B). The t^aninology "For your protection..." tias 
aobiguous enough so that many ^ questioned what th^ wne being protected 
from. For Experiment 2, the reference was omitted and S was given the form 
in Appendix E to sign. 

The major difference in procedure between Experiments 1 and 2 is the 
instructions regarding aixioyanoe re^xxise. Bophasis is made in Experiment 2 
on the annoyance felt at the tir^e that the re^xmse is reported, not on the 
annoymx» felt at the time of the noise events. The instructions are 
contained in Appendices F and G. 

An «q>eriraental session lasted approximately one hour. The 45 
experimental conditions were randomly assigned. The first eiq>eriinental 
session consisted of a practice game session followed fay tiie standard game 
session and several test game sessions totalling between 33 and 40 minutes 
of game time. The practice game session was eliminated for the rmnaining 
four days. The remaining eiqperimental sessions consisted of the standard 
and test sessions totalling between 42 and 53 minutes of game time. S was 
paid a participation fee of $20.00 at the end of the fifth da^. After S was 
paid, E asked S to answer the question: "If the standard annoyance is *T00, ' 
that number represents the annoyance level at thich any nunber you gave 
above that nunber you would label 'annoj'ing* and any nunober you gave below 
that number you would label 'not annoying.*" If S had difficulty 
understanding this question, E r^)eated the questTon slowly (usually with 
hand-motions) until E was satTsfied that S understood that S was asking S 
for his threshold of annoyance. 

5.0 Results and Discussion - Experiment 2 

The results of Experiment 2 indicate that indeed the annoyance response 
function is a decreasing function (Figure 10} . The geometric means of the 
individual annoyance re^xxise functions for each amplitude were fitted by 
means of a standard linear regression analysis to a linear decay function: 

Annoyance Level » AQ(i-at), 

where Aq is dependent on the initial amplitude and is the value of the 
function at t«0 (zero decay time) and a is the linear rate of decay (Figure 
10, dotted lines) . The linear annoyance decay function has this form 
because initial perusal indicated that £ was proportional to the initial 
amplitude Aq, ihe calculated values of the two parameters and the r^-values 
for the linear regressions are presented in Table 4. 

Although the r^-values are significantly different .ram zero (p<.01) 
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TABLE 4 

OOHFARISON BEIWEEN A LINEAR DECAY MODEL AM} AN EXPONENTIAL IXCAY MODEL FOR 

EXPERIMENT 2 



Linear Decay Model: 

Annoyance Re^xxise 

= Ao(l-at) 


r2 

a 

bo 

75 dB(A) 

.747649 

11.625235 

41.153333 

80 dB(A) 

.715724 

10.038335 

54.088222 

85 dB(A) 

.767034 

13.805311 

60.391333 

90 dB(A) 

.586764 

11.097884 

82.098445 

95 dB(A) 

.680984 

9.910625 

117.467393 

Exponential Decay Model 

: Annoyance Re^x>nse = Aq e®*" 


£ 

a 

ho 

75 dB(A) 

.788989 

-.135082 

42.952180 

80 dB(A) 

.786966 

-.175912 

57.836723 

85 dB(A) 

.778537 

-.098288 

60.908009 

90 dB(A) 

.677366 

-.139758 

83.273811 

95 dB(A) 

.747910 

-.148695 

113.676275 
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indicating that the slope of the function is significantly different from 
zero, the cuinoyance responses made between zero and three minutes after 
presentation appear to decay more rapidly than this model pr^icts (see 
Figiires lOD and E). An alternative model «Aich could explain this change in 
the rate of decay is the expon&itial dec?\' ninction: 

Araioyance Level » Aq e^tr 

vhere ^3 dependent on the initial amplitude and is the value of the 
function at t^ and a is the decay constant. Ihe geometric mean re^onses 
are fitted to the exponential decay model in Figure 10 (solid lines) If 
the eiqxxiential decay model provides a better fit to the data than does the 
linear decay model, this analysis «iould resu.^ . in larger r^-vadues than vhen 
a regression to the linear decay model is performed, the re^xxise functions 
for this model are also presented in Table 4. Ihe revalues for these 
functions indicate that the exponential decay model is a better fit to the 
data than is the linear decay model. 

However, the decay constants of the exponential function calculated 
from this experiment do not appear to be systematic. Furthermore, the 
annoyance responses in the first three minutes of deca^ times seem to decay 
faster than the calculated exponential functions predict. The subjects were 
asked at the conclusion of this experiment to define their "threshold" 
value. The nunter of subjects whose araioyance rei^xxise was at or below his 
threshold value is shown in Table 5. Notice that after three minutes, 
particulaurly in the lower anplitude condicions, the majority cf subjects 
were at their threshold values. Subjects r^orted that, in mat^ cases, vhen 
their response was below their threshold value, th^ would have said "zero" 
if it were permissable. They found it very difficult to define a re^onse 
proportional to an annoying standard when they were not annoyed. If only 
the first three minutes of decay times are considered, a consistent pattern 
forms (Table 6). The functions for the amplitudes between 75 cund 85 dB(A) 
have decay constants between -.180 and -.195. At 90 and 95 dB(A) the decay 
constants increase sharply to between -.325 and -.343. The parameters for 
the linear decay model are also included in Table 6 for the first three 
minutes of decay times. The exponential decay furKrtion pr xdicts the results 
better than the linear decay function for three of the five amplitudes (75, 
90 and 95 dB(A)). For 85 dB(A) both functions fit the data well and for 80 
dB(A), neither function predicts the data well. 

Before conclusions can be made regarding the form of the decay function 
from the values of the parameters and the least-squares estimates, it is 
suggested that an additional experim^t be performed that focuses only on 
the first three minutes of decay. Afterall, the conclusions that can be 
mcKie in this experimmit are based on only four data points per amplitude. 

It is suggested that these four values be verified in addition to including 
other decay times within the first three minutes of decay in a supplanentary 
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TABLE 5 

NUMBER OF SUBJECTS NHOSC AIMOYANCE RESPONSE WAS AT OR BELOW HIS THRESHOLD 

VALUE FOR EXPERIMENT 2 (N > 9) 


Decay Tijnes: 012345678 minutes 


75 dB(A) 
80 dB(A) 
85 dB(A) 
90 dB(A) 
95 dB(A) 


2 5 4 
14 4 
2 2 3 
0 2 3 
0 0 2 


4 6 6 

4 3 7 

4 5 3 

5 4 2 

2 3 3 


6 7 7 

4 6 6 

5 4 4 

4 4 7 

4 3 4 
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TABLE 6 

OQMBARISON BE7IMEEN A LINEAR DECAY MODEL AND AN EXPONENTIAL DECAY MODEL FOR 
EXPERIMENT 2 (DECAY RATES BETWEEN 0 AN) 3 MINUTES ONLY) 



Linear Decay Model: 

Annoyance Response 

* Ao(l-at) 


r2 

a 

bo 

75 eB(A) 

.688101 

6.477567 

45.479000 

80 dB(A) 

.462891 

6.540437 

57.824000 

85 dB(A) 

.958233 

7.183345 

68.249000 

90 dB(A) 

.780964 

4.357328 

100.602000 

95 dB(A) 

.969121 

4.483495 

143.712000 

Exponential Decay Model 

: Annoyance Response * Aq 



a 

bo 

75 dB(A) 

.696934 

-.195885 

45.266659 

80 dB(A) 

.416390 

-.133099 

55.711937 

85 dB(A) 

.955532 

••.180655 

69.353365 

90 dB(A) 

.863263 

-.325170 

98.842602 

95 dB(A) 

.993283 

-.343681 

155.499704 
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experiment. 

6.0 Method - Experimait 3 

6.1 Subjects 

Five of the nine Ss Who participated in &^riment 2 continued in this 
ei^riment along with Tive additional Ss. Most of the Ss, 9, were male. Ss 
were paid $10.00 for participation — $?.0o per hour for a total of 2 hours. 
All Ss rqx}rted normal hearing. 

6.2 Stimuli 

the time betwe«i presentation and re^x>nse and the anplitude of a noise 
burst within a game session varied in a 7X5 factorial experiment with 
repeated measures all factors. Ihe time betweai the conclusion of a 
10-second noise burst «)d the response is 0, .5, 1, 1.5, 2, 2.5 or 3 
minutes; and the peak enplitude of the noise burst is 75, 80, 85, 90 or 95 
dB(A). The noise burst occurs 50 seconds after the start of the game 
session. As in Experiment 2, the burst of noise is not ramped on and off. 
The standard is a 10-second burst of vihite noise with peak amplitude of 85 
dB(A) which occurs 50 seconds after the start of a 1-oiirHite game session. 

6.3 Apparatus 

The audio-control system is r^resented the block diagram of Figure 
1 and described in the Method section of Experiment 1. 

6.4 Procedure 


The procedure is the same as Experiment 2 and described in the Method 
section of Experiment 2. 

An experimental session lasted e^roximately one hour. The 35 
experimeital conditions were randomly assigned. Since the game sessions 
were short, more sessions were run per experim«ital hour than in either 
Experiment 1 or 2. The first experimental session consisted of a practice 
game session followed by the standard and several game sessions totalling 
between 37.5 and 40 minutes of game time. The practice game session was 
eliminated for the second day. The experimental session consisted of the 
standard and gene sessions totalling betwe^ 47.5 and 50 minutes of game 
time. S was paid a participation fee of $10.00 at the end of the second 
day. 

7.0 Results and Discussion - Experiment 3 


The results of Experiment 3 concur with the results of the prior 
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experiments. Ttve annoyance response function is a decreasing function 
(Figure 11) . The annoyance response means were fitted to a linear decay 
funrtion and an exponential decay function as was done in Experiment 2. 
Results indicate that for all amplitudes except 80 dB(A) the expon^itial 
decay function (solid lines) provides a better fit to the data than does the 
linear decay function (dotted lines) as eui annoyauice response function 
(Table 7). These results for 80 dD(A) seem to be an anorooly because of the 
last two data points in the function (2.5 and 3.0 minute decay times) . 

Indeed vhen the regression analysis is performed oa the re^xxise function 
for 80 dB(A) without the last two data points, the exponential decay 
function provides a better fit to the data than does the linear decay 
function. The results shown in Table 7 indicate that the three higher 
anplitudes have similar decay constaunts whereas the two lower anplitudes 
have a smaller decay constant. In fact, a test of the significance between 
the decay constants (actually the significance between tho two slopes 
resulting from the linear regression of the log gecroetiic means) indicates 
that, at the .10 level, the decay constants for the re^xsnse functions of 75 
and 80 dB(A) do not differ significantly from each other, nor do the decay 
constants for 85, 90 and 95 dB(A). However, the decay constants for 75 and 
80 dB(A) differ significantly from the decay constants for 85, 90 asid 95 
dB(A). Therefore, it can be concluded that the rate of decay for the higher 
anplitude bursts is significantly greater than for the lower anplitudes. 

The results of this experiment indicate that the function that provides 
a good description of the annoyance response function is: 

Annoyance Response * Aq 

Though the value of the decay constant alpha is dependent on the units of 
time used in its calc ilation, the decay constant has no imnediately obvious 
mecuiing in that its value is unitless. Therefore the half-life is 
introduced to add meaning to the decay constant. This is the length of 
time, At, in which the annoyance response is reduced by 50%. To relate At 
with alpha, consider two time instances, t^ and t 2 » ti + At (see Figure 12) 
and let Aj^ and A2 denote the annoyance response at these time instances, 
respectively. From the formula for the exponential decay function: 

Ai « An e®^l» . 

A 2 » Ao e®*^2« Aq ea(^l'^At)» Aq e®^l * or 

A2 « Ai e®^*- 

Since the definition of half-life assumes that only half of the annoyance 
response remains after the time interval, A 2 ■ ^/2 Ax. This yields: 

e®^ * 1/2, 

aAt - In 1/2 « -0.69315. 
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TABLE 7 

CX>1PARIS0N BETIVjEEN A LINEAR DECAY MODEL AND AN EXPONENTIAL DECAY MODEL FOR 

EXPERIMENT 3 



Linear Decay Model: 

Annoyance Response 

= Aq ( i-at) 



a 

bo 

75 dB(A) 

.590825 

5.037318 

42.615714 

80 dB(A) 

.626572 

5.617140 

60.556786 

85 dB(A) 

.861182 

4.100432 

92.271429 

90 dB(A) 

.874364 

4.166917 

134.939643 

95 dB(A) 

.964670 

4.228163 

171.093214 

Expon^tial Decay Model 

: Annoyance Response * Aq 


£ 

a 

bo 

75 dB(A) 

.683987 

-.251139 

41.396634 

80 dB(A) 

.603558 

-.218984 

59.118162 

85 dB(A) 

.912166 

-.378814 

95.301076 

90 dB(A) 

.949757 

-.374055 

140.420170 

95 dB(A) 

.977339 

-.384411 

182.667329 
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Hencer the half-life is 


At - 0.69315/-a. 

Therefore vrtienever a time interval of length At elapses, the annoyance 
response is reduced by one-half. Sin*^ the half-life is a linear transform 
of the decay constant, conclusions made regarding the decay constants in 
Experimaits 2 ari 3 are also true for half-life estimates, i.e., the 
half-life for the higher anplitude bursts is significantly shorter than for 
the lower acplitude bursts (Table 8) . Hie results for 0 to 3 minute decay 
times for Experiments 2 and 3 indicate that the higher amplitudes have a 
nalf-iife between 1.8 and 2,1 minutes and the lower amplitudes have a 
half-life between 2.7 and 3.8 minutes. 

8.0 Gaieral Discussion 


Hiis series of experiments demonstrates that annc^ance decays as time 
progresses from the occurrence of a noise burst and that the form of the 
decay is best approximated as an exponenvial. Hie purpose of this research 
was to define v^ich parameters were appropriate for defining an annoyance 
rei^nse function. Once the important parameters of a model were defined, 
an examination of the value of those parameters for specific stimuli within 
a particular context were calculated. Past researdi indicated that maiiber, 
duration and amplitude were three important pzurameters in the determination 
of cotmiunity annoyance response to aircraft rxiise. Usually these parameters 
are examined by researchers in either a ccmnnunity setting in whidi residents 
are asked to .ategorize their annoyance produced by noise exposure or in a 
laboratory setting in which subjects are asked to categorize or magnitude 
estimate their annoyance. The design of these experimaits fail to take into 
account the dynamic envirorroent, that is, how each noise evait contributes 
to the individual's annoyance level. Hie models most commonly proposed by 
researchers lack a temporal parameter. Before an examination of the 
annoyance response function for individual noise events could proceed, it 
was necessary to determine whether the inclusion of a temporal parameter in 
the annoyance response function would significantly reduce the amount of 
varirince atout this function. When other attempts had been made to examine 
the effect of presentation time on annoyance response, researchers failed to 
confirm any systematic time-effects (Powell, Note 6; Shepherd, 1981). 
Therefore, it was first necessary to determine why other researdiers had not 
found what intuitively seemed to be an important parameter in the 
determination of annoyance response. 

An examination of the techniques used by Powell nd Sh^iherd in their 
research indicated that subjects in these experiments, as is the case in 
most experiments which test models of emotional response, were asked to 
attend to the emotion-provoking stimuli exclusively or while engaged in 
passive activities. It was hypothesized that this lack of activity did not 
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TABLE 8 

HALF-LIFE ESTIMATES OF ANNOYANCE RESPONSE BASH) ON THE DECAY 
CONSTANTS CALCULATED FOR EXPERIMENTS 2 AND 3 
(0-3 MINOTE DECAY TIMES ONLY) 


Experiment 2 (Minutes) Experiment 3 (Minutes) 


75 dB(A) 

3.53856 

2.76002 

80 dB(A) 

3.78565 

3.16531 

85 dB(A) 

3.83263 

1.82979 

90 dB(A) 

2.13166 

1.85307 

95 dB(A) 

2.01684 

1.80315 
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encourage "reported" annoyance to decay. As we know in real-life as noise 
occurs around us. we are not attending to the noise exclusively. He are 
engaged in activities that range from being ^traoely passive to bein^ 
extremely active. Ihe amount of involvement in the activities that interest 
us should affect the amount of attention we allocate to the noise events. 

The reqi^t of subjects to attend to noise exclusively inplies a sort of 
accounting procedure %^re subjects hear a noise* evaluate the noise and 
store this information for future reference. The ability to store this 
information and the ability to recall it can be affected by the presence of 
a secondary tadc. Therefore* if there »ists a tenporal parameter* the 
value of the parameter should be dependent on the context of the ei^>eriment. 
In the first experiment* it was necessary to establi^ the effect of 
attention level on the tenoporal parameter as well as to determine the 
gereral form of the annoyance response function. 

Results of Experiment 1 indicate that the tetoporal parameter does 
inprove the fit of the data to the arxxxycuxre response function in a d/namic 
envirormait and that the general form of the annoyauxs response function 
which includes a tenporal parameter is a decreasing function* i.e.* the 
longer the time to response* the more the annoyance re^xonse is reduced. 

When the annoyance re^xonses made by subjects involved in a conpensatory 
tradcing task are coopared to those made by subjects involved in reading a 
nagazine* the rate of decay increases as the activity level of the context 
increases. 

The parameters considered in Experiment 1 were the session length* the 
mznber of noii.e bursts* the anplitude of the noise bursts and the time of 
presmtation of these r^ise bursts. The results of an analysis of variance 
indicate that all the parameters excluding session length are inportant 
parameters in defining the annoyance re^xonse. Session length may be an 
inportant peuraroeter in the determination of annoyetnce re^xxnse outside the 
range of values used in this experiment. However* within the rcoige of three 
to nine minutes* it is not. Of the models considered (peak dB(A), energy 
averaging, energy sumoation and annoyance decay)* the annoyance d^ay mcxdel 
is the only model that has a tenporal parameter. Therefore* the results of 
Experiment 1 support a decay model of annoyance re^x>nse. 

Once the respoa'e function was determined, estimates of the tenporal 
parameter could proceed. Annoyance judgments made for single noise bursts 
wich various times to response could define the annoyance response fixiction. 
The annoyance response function for Experiment 2 indicates that annoyance 
appears to decay expot aitially and then level off or decrease more slowly 
after about three minutes. This decrease in the rate of decay after three 
minutes is attributable to the annoyance level falling below each subject's 
threshold for annoyance. At this point, subjects drift more slowly downward 
in their responses, in that once they are no longer annoyed, it is difficult 
for them to respond with an appr<priate annoyance response. 
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Experiment 3 proceetSed in the sene manner as Ei^riment 2 but 
considered only decay times between zero and three minutes with more 
divisions between those two decay times considered. The results of this 
experiment concur with the results of Experiment 2 for the same presentation 
time interval. 

Since the determination of decay constants for Bjqperiments 2 and 3 
provide no obviously intuitive information regarding the rate of decays the 
half-life for these functions is calculated. This is the time necessary for 
the annoyance response to decrease by one-half. Results indicate that 
within 1.8 to 3.8 minutes, the anncyance re^wnse for all amplitudes 
decreases by on e h alf. The higher anplitudes decay more r^idly (half-life 
between 1.8 and 2.1 minutes) than the lower amplitudes (half-life between 
2.7 and 3.8 minutes). 

This experimmit provides both viable results and methodological tools 
for continued research on the effect of noise aivircxxnents cqpon the 
emotional environnait of individuals. First, if one is interested in ^K>rt 
time periods of noise, the results of these experimmits may generalize to 
other environments provided the attention level of individuals is 
approximated by the attention level of the subjects in this experiment to 
the noise bursts. Secondly, if one is interested in other noise sources or 
other contexts, the procedures established here are certainly reasonable for 
many other stimuli and contexts. Also, if one is interested in multiple 
noise sources, these experiments establish a foundation for this research 
effort c 

Further investigations might attempt to focus more intensely on the 
cause of the decay in annoyance response. The cause of the decay whether 
the decay of annoyance response is attributable to a decrease in the memory 
strength for the sourcre of annoyance and therefore the eissociated euinoyance 
re^»nse is reduced as time passes, or whether the decay is attributable to 
the cognitive interaction on the decaying physiological response is not 
addressed in this research. 

It is established in these experiments that the annoyance re^)onse 
function for i.<’ividual noise bursts as a funcrtion of time is a decreasing 
function which can be best described as an exponential decay function of the 
form: 


Arjioyanc» Response * Aq 

where Aq is the annoyance response at t*0 and a is the decay cxjnstant. The 
value of the decay constant appears to be d^)endent on the level of activity 
within the context of the experiment (the more active the task, the greater 
the rate of decay) and on the amplitude of the noise burst (the greater the 
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anplitucle the greater the rate of decay) . 
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SCALING LOUDNESS DIFFERENCES BY INTRA-MODAL MATCHING: 
EVIDENCE FOR A SINGLE SCALE UNDERLYING LOUDNESS AND 

LOUDNESS DIFFERENCE 


Richard Dorian Popper 


The perception of loudness was studied in four 
experiments. In Experiments 1 and 2 (difference 
matching ) , subjects judged the magnitude of loudness 
differences by adjusting the loudness of a single tone to 
equal the loudness difference between a pair of tones. 
Seventy-two tone pairs were used, constructed from nine 
sound pressure levels of a 1000 Hz tone ranging from 46-94 
dB (Experiment 1) or 55-95 dB (Experiment 2). In 
Experiment 3 (difference estimation), subjects magnitude- 
estimated the loudness differences of the same tone pairs 
as in Experiment 2, and in Experiment 4 they magnitude- 
estimated the loudness of the individual tones used to 
construct the tone pairs. The same subjects served in 
Experiments 2-4, and binaural listening through earphones 
was used throughout. 
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Difftrence matches and difference estimates were 
analyzed nonmetrically to derive underlying scales of 
loudness. For both kinds of judgments, loudness grew as a 
power function of sound pressure, although in Experiment 1 
some departure from the power function was evident at low 
intensities. 

A comparison of the exponents for Experiments 2 and 3 
showed that for the majority of subjects, difference 
matching produced larger exponents than difference 
estimation. The difference matching exponents were in the 
vicinity of the sone scale exponent for loudness, whereas 
the difference estimation exponents were smaller. 

Further analysis showed that the scale underlying the 
difference matches was similar to the loudness scale for 
the tones used to do the matching and' for the single tones 
magnitude-estimated in Experiment 4. This suggests that 
the same scale can underlie both judgments of loudness and 
loudness difference, a hitherto unobserved phenomenon. 
Possible sources for the disagreement between difference 
matching and estimation are discussed, as are the 
individual differences observed in the performance of each 


task 
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INTRODUCTION 


Demonstrations of the simultaneous and independent 
operation of two scales of loudness (Marks, 1979a) argue in 
favor of an heirarchical model of judgment, in which 
separate scales reflect separate processing stages. 
Consider therefore an experiment in which two tones are 
played and the subject is instructed to adjust the loudness 
of a third tone to equal the size of the loudness 
difference. This task involves the matching of a loudness 
difference and might therefore involve the use of both 
sensory and difference scales, as Marks calls them. 
Formally, the matching task can be represented as follows: 


I 


j 



( 1 ) 


I- is the louder tone in the pair, and I is the single 

J iU 

tone whose loudness is used by the subject to match the 
size of the loudness difference. The exponent alpha 
governs the perceptioti of loudness differences, the 
exponent beta the perception of loudness. 

The possibility of matching che loudness of a single 
tone to a loudness difference (difference matching) was 
alref.dy conceived by Fechner, who wrote, "given two tones 
of differing physical intensity, it is possible to imagine 
a third tone, whose intensity equals the difference in 
intensity between them" (1860. I, 48). Stumpf, on the other 
hand, claimed that it was impossible to subtract one 
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sensation from another and "to feel the remainder by 
itself" (quoted in James, 1890, I, p« 547). Russell 
quipped: "A change of length is Itself a length, but a 
change of temperature or illumination is not itself hot or 
bright ... With intensive quantities ... these differences 
in quantity are not themselves quantities. The difference 
between two intensive quantities, in fact, differs from 
each as much as the difference between two horses differs 
from a horse" (quoted in Titchener, 1905, p. Ixxxiii). 
However, this a priori argument is nothing but a 
restatement of the "quantity objection." Whether a 
measurement scale can be formed on the basis of such a 
task is subject to empirical investigation. 

The procedure of difference matching can be compared 
to the task of numerically estimating the size of a 
loudness difference, represented as 

Ij ^ (2) 

In Equation 2, N represents the number reported by the 
subject, the exponent gamma governs the perception of 
loudness differences, and the exponent delta represents 
any nonlinearity in the subject's use of numbers. 

Several predictions can be made concerning the 
relationships between the exponents in the above 
equations. First, note that the exponents alpha and gamma 
govern how loudness grows when loudness differences are 
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being judged. The most obvious prediction concerning 
these two exponents is that they will be the same. That 
is, the expectation is that the same power function is 
recovered, regardless of whether the method of judgment is 
difference matching or estimation. Furthermore, since 
loudness differences are being judged, these exponents 
should full in the neighborhood of previous estimates of 
the interval scale exponent for loudness, nai.ely around 
0.30. On the other hand, differences between alpha and 
gamma can result from either perceptual differences 
associated with the two procedures or response bias 
factors. 

A second prediction concerns the exponent beta in 
Equation 1, which reflects the growth of the loudness of 
the single tone used to match the loudness difference. 
One prediction for this exponent is that it should 
approximate the exponent for sensation magnitude, since 
loudness, not loudness difference is involved. Thus, beta 
is expected to be in the neighborhood of 0.60, the 
exponent for the sone scale of loudness. However, it is 
possible to interpret the difference matching t.^sk as 
involving two loudness differences, one between the two 
tones in the pair, the other between the loudness of the 
single tone and "zero" loudness. According to this 
interpretation, subjects are engaged in matching one kind 
of loudness difference to another, instead of matching a 
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loudness magnitude to a lordness difference. The second 
interpretation predicts that beta will equal the 
difference exponent for loudness, not the exponent for 
sensation magnitude. 

One reason for doubting this outcome derives from the 
result.^ ~n the bisection of loudness (Stevens, 1975). 
Loudness bisection generally leads to scales similar to 
those obtained with other interval scaling procedures. 
However, when a single tone is presented for bisection 
(with subjective zero as the lower endpoint), subjects 
switch from judgi. j intervals to judging ratios and 
produce a tone that is half as loud as the single tone. 
Such 'bisections* are in fact fractionations and imply 
that the single tone is judged on the scale of sensory 
magnitude, not sensory difference. On the other hand, in 
the context of magnitude estimating a series of loudness 
difierences. Narks (1979a) found that subjects judged the 
difference between a tone and a blank stimulus on a scale 
approximating the difference scale. Thus, it seems 
possible that the single tone could be judged on the 
difference scale, instead of th ^ magnitude scale, in which 
case beta would equal 0.30. 

The present research will address several questions. 
Is the exponent for loudness difference the same fcr 
difference matching and difference estimation? Do the 
data from difference matching support the duallstic 
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position that there are two loudness scales, one for 
differences and the other for magnitudes? If a single 
scale can be found that underlies both kinds of judgment, 
how does this scale compare to that obtained from the 
magnitude estimation of Icudness ratios? 

In the practical realm, the utility of both 
difference and magnitude scales has been demonstrrted. 
The sone scale for loudness underlies several schemes for 
the calculation of the loudness of complex sounds 
including noise (Kryter, 1970), whereas the Nunsell scale 
of light reflectance, a category scale, has been applied 
to printing and photography (Newhall, Nickerson & Judd, 
1943). A dualistic viewpoint allows for the peaceful 
coexistence of these two types of scales. A monistic 
theory, on the other hand, is troubled by the need to 
account for the utility of the "biased” scale. 

From the point of view of theory, the persistent 
discrepancies between scales of sensory magnitude and 
sensory difference have proved to be a continued source of 
concern. With such divergent results, one is less 
confident that any psychophysical scale succeeds in 
measuring sensation at all. Marks has responded by making 
a fundamental distinction between the perception of 
magnitude and the perception of difference. The present 
research will entail a closer examination of the 
"psychological reality" of this distinction. The 



investigation of these issues in the psychophysical realm 
will make possible more informed applications of the 
psychophysical methods of magnitude and difference scaling 
to the measruement of nonsensory attributes. 

The proposed experiments entail studying the 
perception of loudness differences using the novel task of 
difference matching. The operation of differencing, as 
the early history of psychophysics demonstrates, has often 
been viewed as the central mechanism for establishing 
psychological relationships. The new method for studying 
this task promises to provide further insight into the 
nature of difference perception. 

Four experiments are reported here. Experiments 1 
and 2 involve the task of difference matching (see Eq. 1) 
and differ only in the stimulus set. Experiment 3 
involves the magnitude estimation of loudness differences 
(see Eq. 2), and Experiment 4 the magnitude estimation of 
the loudness ratios of single tones. The same subjects 
participated in Experiments 2>4. Experiment 4 yields an 
estimate of the ratio scaling exponent for these subjects 
and serves as a "control* to aid in the interpretation of 
the other exponents. 

Several analytic strategies are available for fitting 
Equations 1 and 2 to the data. One procedure is to 
estimate the two parameters in each equation 
simultaneously. However, with this approach it is 



impossibl* to evaluate separately the appropriateness of a 

difference model and the form of the psychophysical 

function. By using nonmetric scaling , it becomes feasible 

first to assess the extent to which data from both 

difference matching and estimation can be described by 

difference structures. Furthermore, since nonmetric 

scaling makes weak assumptions about the functional 

relationship between the responses and the ”true* 

differences, the comparison between the exponents alpha 
% 

and gamma will be free of the influence of monotonic 
biases that may be present in either difference estimation 
or matching. 

Loudness differences have been scaled nonmetrically 
before (see Schneider, 1982, for review). The scaling 
technique originated with the work of Shepard (1962a, b), 
who showed that with a sufficient number of stimuli the 
ordinal ranks of the interstimulus differences are 
sufficient for a recovery of interval scales of 

measurement (i.e. a scale unique up to multiplication and 
addition by arbitrary constants) . Following Kruskal 
(1964), the observed differences are interpreted as 
monotonic with interstimulus distance (Euclidean or 
otherwise) in an n-dimensional stimulus space. On the 
assumption that loudness is unidimensional, the scaling 
problem reduces to one of finding points on a line, whose 
interpoint distances best match, in an ordinal sense, the 
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loudness differences. This is accomplished on a computer 
using an iterative algorithr. 

The nonmetric scal'ng of loudness differences and the 
estimation of the power function exponents alpha and gamma 
completes only the first stage of the analysis. In the 
second stage, the estimated loudness differences are used 


to derive estimates of beta 

(Equation 

1) 

and 

delta 

(Equation 2) . 

For this purpose, the 

full metric 

information in 

the responses 

i3 needed. 

It 

may 

seem 

inconsistent to 

neglect this 

information 

in 

the 

first 


stage, but subsequently to introduce it in the second. 
However, a nonmetric analysis is preferred in the first 
stage for the reasons cited above. The present approach 
is consistent with that employed by others (Birnbaum, 
1982; Marks 1978, 1979a; Rule 6 Curtis, 1982) who have 
sought to differentiate between the "input” function, such 
as the scale for loudness difference, and the "output" 
function, such as the scale governing the use of numbers, 
or, in the case of difference matching, the scale for the 
single tone. 
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METHOD 


Experiments 1 and 2 
Subjects: 

Three subjects participated in Experiment 1; 
subsequently r two of these (CJ and NH) and an additional 
five subjects participated in Experiment 2. Among the 
five additional subjects, two (MR and MP) had previously 
taken part in Experiment 3. Of these eight subjects, four 
were paid for their participation, either as work-study 
students or as undergraduate recruits. The remaining 
subjects, including three graduate students in psychology, 
were not paid. Subjects' ages ranged from 21 to 34. All 
subjects reported having normal hearing. 

Apparatus end Procedure: 

The apparatus was configured as shown in Figure 1. A 
1000 Hz tone, generated by a Hewlett-Packard oscillator 
(Model 200 CD), was fed to Relay 1 (Rl). The 
normally-open side was connected to an electronic switch 
(Grason-Stadler Model 829E) . The normally-closed side was 
passed through a subject-controlled attenuator and from 
there, through the normally-closed side of Relay 2 (R2) , 
to the electronic switch. The switch was triggered 
externally and gated the signal with a rise-decay time of 
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10 msec. The output of the switch was passed to a 
programmable attenuator , and from there to the subject's 
earphones (TDH 39, 300 ohms, mounted in NX 41/AR 
cushions) . Listening was binaural throughout the 
experiments. 

All process control operations were performed by an 
AOS 1800E computer. These included the timing of stimulus 
durations and inter stimulus intervals, opening and closing 
Relays 1 and 2, triggering the electronic switch, and 
controlling the programmable attenuator. 

The subject-controlled attenuator was a ”sone 
potentiometer" consisting of two 2000-ohm variable 
resistors ganged and cascaded (Stevens & Guirao, 1964). 
By rotating a knob through approximately 270 degress, the 
subject could vary the attenuation continuously over a 
range of about 77 dB. The change in attenuation with 
rotation is plotted in Figure 2. The knob was unmarked 
and no numeric scale was indicated. 

The subject was provided with three keys. Pressing 
and releasing Key 1 resulted in the opening of Relays 1 
and 2 and the presentation of two 850 msec tone bursts 
separated by a silent Interval of 500 msec. With Relays 1 
and 2 open, the subject-controlled attenuator was 
completely decoupled from th<; signal path, and the level 
of each burst at the subject's earphones was controlled by 
the setting of the programmable attenuator. Relay 
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switching and changes in attenuation were accomplished 
without audible transients in the silent intervals 
surrounding the tone bursts. After the second tone bursty 
both relays were closed. 

Pressing Key 2 resulted in the presentation of a 
single continuous tone that remained on for as long as the 
key was depressed. During its presentation. Relays 1 and 
2 were in their normally closed position, passing the 
signal through the subject-controlled attenuator. 
Therefore, the level of the continuous tone was under the 
subject's control. 

For the duration of the single tone, the programmable 
attenuator was set at one of two values, separated by 10 
dB, Alternate values were chosen on successive trials. 
This served to reduce the association between the position 
of the knob on the subject's attenuator and the signal 
level at the earphones, and also prevented the subject's 
setting during the previous trial from being carried over 
to the present one. 

The subject was allowed to sample between Keys 1 and 
2 at will. A dead time following the presentation of 
either the single tone or the tone pair resulted in a 
minimum temporal separation of 500 msec between them. By 
pressing Key 3, the subject signalled the experimenter 
that he had completed the trial. To mark the intertrial 
interval, a light was turned on at the subject's station 
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and left on until extinguished by the experimenter at the 
beginning of the next trial. 

During the intertrial interval, the subject's setting 
of the single tone was determined by measuring the signal 
voltage at the output of the subject-controlled attenuator 
with a Hewlett-Packard digital voltmeter (Model 3476 B) . 
At low levels, these measurements were taken after passing 
the voltage through a Hewlett-Packard line amplifier 
(Model 450 AR) with a maximum gain of 40 dB. 

Tone frequency was set using a Hewlett-Packard 
electronic counter (Model 521 C) and confirmed 
occasionally between experimental sessions. The earphones 
were calibrated prior to the first experiment using a 6 cc 
MBS earphone coupler and a Bruel & Kjaer sound level meter 
(Model 2203) with octave band filter set (Model 1613). 
The oscillographic traces of the signals transmitted by 
the earphones were found to be free of clipping. 

In addition, a calibration was performed that related 
the signal voltage at the output of the subject's 
attenuator to the sound pressure level at the earphones. 
A log unit change in attenuation produced a log unit 
change in sound pressure over the entire range of 
measurable sound levels (down to approximately 30 dB) . 
Therefore, the voltages measured during the experiments 
were later converted to dB SPL by reference to a fixed 
voltage and its corresponding sound pressure level. 



The maximum adjustable level of the single tone was 
107 dB or 97 dB SPL, depending on the setting of the 
programmable attenuator on a given trial. The 
corresponding minimum levels were 30 and 20 dB SPL. By 
increasing the setting of the programmable attenuator, the 
experimenter ~ upon the subject's request — • could shift 
the dynamic range of the sone potentiometer downward. 
This enabled the subject to adjust the level of the single 
tone as low as desired. 

At the beginning and end of each experimental 
session, the voltages at the input to the earphones were 
checked a\. the highest and lowest levels used to construct 
the tone pairs. In addition, the sound pressure level 
produced by a middle setting of the sone potentiometer was 
determined. These level checks revealed only minimal 
drift over the course of an experimental session. 

The subjects were seated in an Industrial Acoustics 
sound-attenuatlng chamber (Model 1204) . In order to 
familiarize them with the apparatus, a warm-up task was 
presented at the beginning of the first session. (Subjects 
in Experiment 1 had performed this task during a pilot 
version of the experiment and did not repeat it). This 
task consisted of making the loudness of the tone on Key 2 
match the loudness of a single tone presented on Key 1. 
Four such matches were performed, one of which required 
subjects to request a lowering of the range on the 
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potentiometer. Communication during the experiment was 
conducted via intercom. 

The subjects were then instructed in the experiment 

proper. (A complete set of instructions for all 

experiments is contained in Appendix 1) . They were 

informed that Key 1 would deliver a pair of tones 

differing in loudness, whereas Key 2 would deliver a 

single tone whose loudness they could control with the 

potentiometer.. The instructions continued cs follows: 

"Your task in this experiment is to tell me how 
different in loudness the two tones in the pair are. 
You do this by adjusting the level of the single 
tone. In particular, adjust the level of the single 
tone to equal the loudness difference between the 
tones in the pair. That is, make the loud..ess of the 
single tone equal the difference that results from 
subtracting the softer tone in the pair from the 
louder one. If the difference is small, make the 
single tone soft. If the difference is large, make 
it loud. In other words, make the loudness of the 
single tone equal the size of the loudness 
difference. • 

Subjects were encouraged to arrive at their settings by 
bracketing. In addition, subjects were told they were 
free to sample the keys in any sequence and that they 
could request a lowering of the potentiometer range on any 
given trial. 

Nine levels of the ICCC IIz tone w«si« used co generate 
36 pairs of unequal tones for presentation on Key 1, The 
levels were different for Experiments 1 and 2 and are 
listed in Table 1. Subjects in each experiment 


participated in four sessions. Within a session, each of 
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TABLE 1 

Levels of the 1000 Hz tone used In Experiments 1-4 


Experiment 1 

(dB SPL 


Experiments 2/3^4 
re 20 ybcPa) 


46 

55 

52 

60 

58 

65 

64 

70 

70 

75 

76 

80 

82 

85 

88 

90 

94 

95 



the 36 pairs occurred twice , with the order of the tones 
within the pair counterbalanced. The 72 pairs were 
presented in random order during a session. Sessions 
lasted 70-90 minutes, including at least one 10 minute 
break after the first half hour. 


Experiment 3 
Subjects: 

The same 7 subjects who participated in Experiment 2 
participated in Experiment 3. Of these, subjects MP and 
MR performed Experiment 3 before performing Experiment 2. 


Apparatus and Procedure: 

The apparatus was the same as that for Experiment 2, 

except that the sone potentiometer was not used. The two 

tones in the pair were presented in the same way as in 

Experiment 2, but the computer program was modified to 

disable all but one of the three response keys. 

Subjects were instructed as follows: 

"In this experiment you will be listening to pairs of 
tones. The tones in each pair will differ in 
loudness. Your task is to decide how different in 
loudness the two tones are and to assign a number to 
that difference. You will first hear a pair of tones 
whose difference we wHl assign the r.. aber *60', to 
give us a starting p< t. For any subsequent pair, 
if the tones in the (.• .a > sound twice as different as 
did those in the first pair, assign ic the number 
*120*. If in some pair the tones sound half as 
different as did those in the first pair, assign it 
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the nuaiber *30*. You may use any positive number you 
like integer, fraction or decimal. You may not 
use negative numbers or zero. * 

Subjects were told that a key press would deliver the tone 

pair and that they were free to listen to a pair 

repeatedly. Each subject spoke his response over the 

intercom and then proceeded to the next trial. 

The sound levels used to construct the tone pairs 

were the same as in Experiment 2. Each subject 

participated in four sessions, designed as in Experiments 

1 and 2. Preceding the presentation of the 72 pairs was 

the standard pair, which was identified as the pair whose 

difference was equal to *60'. This standard pair was made 

up of tones with sound pressure levels of 65 and 80 dB. 

The order of these tones in the pair was alternated across 

sessions for each subject, and their order in the first 

session counterbalanced across subjects. Although the 

standard pair reoccurred in the series of 72 pairs, it was 

identified by the experimenter as the standard only at the 

beginning of the session. Sessions lasted 45~€0 minutes, 

including a short break at the half-way point. 

Experiment 4 


Subjects: 

The same subjects as in Experiments 2 and 3 
participated in this experiment. Experiment 4 was 



performed last by all subjects 


Apparatus and Procedure: 

The apparatus was the same as in Experiment 3. 
However# the computer program was modified for the 
presentation of a series of single tones. 

Subjects were instructed as follows: 

"In this experiment you will be listening single 
tones of different loudness. Your task is to tell me 
how loud each tone seems to you by assigning a number 
to its loudness. The first tone you hear we will 
assign the number *100', to give us a starting point. 
For any subsequent tone# if the tone seems twice as 
loud as the first tone# assign it the number *200*. 
If a tone seems half as loud as the first tone# 
assign it the number *50*. You may use any positive 
number you like — integer# fraction or decimal. You 
may not use negative numbers or zero." 

Subjects were informed that a key press would deliver the 

tone# but that they could listen to each tone only once. 

After speaking their response# subjects went on to the 

next trial. 

The nine sound levels used to construct the tone 
pairs in Experiments 2 and 3 were presented in 10 
successive sets. Within a set# the presentation order of 
the nine tones was random# with the constraint that the 
first tone in a set could not be identical to the last 
tone of the previous set. Each subject participated in 
only one session# which lasted 20-30 minutes. The 
standard was a tone of 75 dB SPL and was played at the 
beginning of the session. Subsequent occurrences of the 
standard in the series of 90 tones were not identified. 
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RESULTS 

Experiment 1 

Each subject's adjustments of the level of the single 
tone were converted to decibels of sound pressure and 
averaged arithmetically across the tone pairs (i,j) and 
(j,i) and the last three sessions. Data from the first 
session were discarded. The 36 average difference matches 
per subject are listed in Appendix 2. In order to assess 
be tween-session reliability, Kendall's coefficient of 
concordance, W, was computed from the ranks of the 
wlthin-session averages (of pairs i,j and j,i). The 
values of W were 0.94, 0.94, and 0.97 for subjects BL, CJ 
and NH respectively, indicating good agreement across 
sessions. 

Each subject's average difference matches were 
submitted to TORSCA-9 (Young, 1968) for nonmetric scaling. 
This scaling routine consists of two separate Iqorithms. 
The first prepares an initial configurac using 
factor-analytic methods and is *semi-metcic" in that 
multiplications and additions are performed on the input 
data. The second algorithm is fully nonmetric and starts 
with the configuration derived by the first part. On a 
given iteration, the points corresponding to each stimulus 
are moved in search for the best monotonic fit between the 
interpoint distances and the average decibels. The degree 
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o£ fit is indexed by a measure called stress (Kruskal, 

1964), usually expressed as a percentage. A perfect fit 

has a stress of zero and exists when the interpoint 

distances are perfectly monotonic with the raw 

differences. However, with fallible data there will 

usually be some violation of monotonicity. Stress 
reflects the extent to which the interpoint distances 
deviate from this perfect monotonic relationship and 
therefore resembles the residual sums of squares in 
regression analysis. As an index of fit, stress also 
provides a criterion for determining when the best 
possible solution has been obtained. The iterative 

process terminates when further changes in the interpoint 
distances produce only insignificant reductions in stress. 

One difficulty with iterative scaling algorithms is 
the possibility of terminating the iterative search at a 
local minimum. The risk of such nonoptimal solutions is 
greater in the one-dimensional than the multidimensional 
case, because points cannot easily move past each other 
during iterative computation if they lie on a line 
(Kruskal, Young & Seery, 1973). TORSCA-9 was chosen as a 
scaling routine, since it is known to be relatively robust 
against local minimum problems (Spence, 1970, 1972). 

The final stress values for subjects BL, CJ, and NH 
were 5.8%, 8.0% and 2.2%. A means of evaluating the 

quality of the obtained solution is provided by Young’s 
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(1970) index o£ metric determinacy, M, which is defined as 
the squared correlation between the true interpoint 
distances and the distances recovered by the nonmetric 
analysis. The index can be interpreted as the proportion 
of variance in the true distances which is accounted for 
by the recovered distances. Sin^e the true distances are 
unknown, M can only be estimated. Young provides a 
nomogram for estimating N on the basis of the number of 
stimuli, the number of dimensions, and stress. For BL, CJ 
and MB the estimates of M were 0.98, 0.97 and 0.99, 

respectively. 

The principal output of TORSCA consists of scale 
values for the nine stimulus intensities, which are 
contained in Appendix 3. Insofar as the nonmetric scaling 
results are characterized by low stress and high metric 
recovery, these scale values represent interval-scale 
measures of loudness. That is, L^«aTx+b, where Li is the 
loudness of stimulus i, Tj^ is its TORSCA value, and a and 
b are arbitrary constants. A power function relationship 
between loudness and stimulus intensity implies that 
L»aT+b«kin, Equivalently, T+(b/a) »(k/a) I”, or T+u«vin, 
Taking logarithms on both sides yields log(T-i-u)> 
nlog I+log v, which says that for some value u, the 
logarithm of (T+u) is a linear function of stimulus 
Intensity in decibels. The value u is a free parameter 
and was chosen to maximize the squared correlation (r^) 



between logCT-t-u) and decibels. This was accomplished by 
computing r2 while varying u in small steps over a range 
of values. The value of u for which r2 is maximum is 
designated u*» and P* [»T+u*l multiplied by 100 for 
convenience in plotting is called a loudness projection . 

In Figures 3-5, the loudness projections are plotted 
on a logarithmic axis as a function of sound pressure 
level in decibels for each subject. In these coordinates, 
a power function relationship appears as a straight line. 
The exponent of the power function is equal to the slope 
of the line and can be estimated by least-squares methods. 
For subjects BL, CJ and MH the exponents (slopes) were 
0.66, 0.46 and 0.55, and the values of r^ 0.966, 0.996, 
0.961. These exponents estimate alpha in Equation 1 
rr <X OK A 

lij -li *Im” ]• Both subjects BL and NH show a 
systematic deviation from a power function at the lowest 
stimulus levels, which is considered in more detail below. 

The objective of this experiment was to determine two 
loudness function exponents: the exponent alpha (estimated 
above) underlying the perception of loudness difference 
and the exponent beta governing the loudness growth of the 
single tone adjusted by the subjects. Taking logarithms 
on both sides of Equation 1 yields log(Ij^ -Ii^)“/3(log 
^m) t which states that the logarithm of the (estimated) 
loudness differences is linearly related to the decibel 
settings produced by the subject, with a slope equal to 
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beta. The values of alpha from above can be used to 
estimate (Ij^ ) » ««d a linear regression of log In on 
log(Ij^-Ij^) provides a least-squares estimate of 
1/beta, from which beta can be determined by taking the 
inverse. The values of beta for subjects BL, CJ and NH 
were 0.82, 0.53 and 0.47, the corresponding r2's 0.931, 
0.893 and 0.956. The functions are shown in Figures 6-8. 

In these Figures, deviations from the best fitting 
power function are again visible in the form of a 
flattening of the function towards the lower end. This 
suggests that subjects are responding similarly (producing 
similar decibel settings) to a range of small loudness 
differences. The curvature is consistent w'.th evidence 
from Figures 3 and 5, which showed that, for subjects BL 
and NH, the adjusted loudness projections of the bottom 
three stimuli were virtually the same. Clearly, these 
subjects did not assign a distinct loudness to the three 
lowest intensities. Indeed, all three subjects reported 
difficulty in telling the louder from the softer tone when 
the tone pair was comprised of the two bottom stimuli. 

These departures from a simple power function make 
the estimation of exponents in Eq. 1 somewhat problematic. 
For example, the curvature of the plot in Figure 6 reduces 
the slope (1//?) of the best-fitting function, thereby 
leading to an overestimate of beta for subject BL. Also, 
for subjects BL and NH, the exponent alpha is clearly not 
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reprasentative ct the growth o£ loudness over the entire 
stimulus range. Furthermore, confusions among a subset of 
stimuli can influence the outcome of the nonmetric scaling 
in a less obvious way. Thus, while subject CJ's plot of 
loudness projections versus decibels accords well with a 
power function, confusions among the low level stimuli 
could well ha' e produced a reduction in inter stimulus 
spacing, since there would be a tendency for these stimuli 
to be clustered together. This would result in depressing 
the estimate of alpha. 

Thus, Experimant 1 represents only a partial success. 
Subjects appear able to perform the task of difference 
matching and produce functions that are nearly power 
functions, for both loudness difference and loudness. In 
this respect the results are encouraging. . However, 
further evaluations of the specific models of 
psychophysical judgment are hampered by the observed 
deviations from a simple power function. Experiment 2 was 
conducted in order to improve on these results by a change 
in the stimulus set. In this experiment, the lowest 
stimulus was 9 dB above the lowest stimulus in Experiment 
1. This change was predicated on the assumption that the 
problems in Experiment 1 were due primarily to the 
absolute level of the stimuli. By eliminating some of the 
low level sounds, it was hoped that better fitting power 
functions could be obtained. 
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Experiment 2 

As in Experiment 1, the difference matches from the 
first of four sessions were discarded and the remaining 
data averaged as before. (Results for subject MP are 
based on the last three of five sessions. Following the 
second session, subject MP reported having been influenced 
by factors other than loudness, such as the annoyance, of 
the tones in the pair. Therefore, data from this 
subject's second session were also discarded, the subject 
was asked to judge only loudness differences, and a fifth 
session was added as a replacement.) The average 
difference matches for the seven subjects are contained in 
Appendix 2« 

Kendall's coefficient of concordance was again 
computed as a measure of across-session reliability (see 
Table 2). With the exception of CJ, all values are in 
excess of 0.91, indicating good agreement across sessions. 
Subject CJ shows somewhat greater variability from session 
to session. 

The scale values produced by TORSCA-9 for each 
stimulus are contained in Appendix 3. Table 2 shows the 
final stress values. According to Young's nomogram, 
between 97-99% of the variance in the true distances is 
accounted for by the recovered distances. As a precaution 
against local minimum problems, the difference matches 
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TABLE 2 

Values of the coefficients of concordance (W) 
and Kruskal's stress for Experiments 2 and 3. 

Experiment 2 Experiment 3 


ibject 

W 

Stress 

W 

Stress 



(%) 


(%) 

CJ 

.87 

8.5 

.85 

7.9 

GA 

.98 

2.7 

.97 

2.1 




(.97) 

(2.8) 

JF 

.96 

4.3 

.92 

4.4 

JP 

.94 

6.4 

.95 

4.9 

MP 

.96 

6.1 

.92 

5.3 

MR 

.94 

7.1 

.95 

5.3 

NH 

.96 

3.7 

.94 

3.2 



were reanalyzed using UNICON (Roskam, 1977) . This scaling 
routine is entirely nonmetric and quite different in its 
algorithmic structure from TORSCA-9. Despite these 
differences, the squared correlations for each subject 
between the two sets of scale values were equal to or 
exceeded 0.9994, indicating excellent agreement between 
the two scaling routines. Two such different scaling 
algorithms are unlikely to arrive at the sa..e suboptimal 
solution, indicating that the obtained solutions represent 
global rather than local minima. 

In Figures 9-*15, the loudness projections are plotted 
oh a logarithmic axis against sound pressure level in 
decibels. ' The procedure for optimizing the power function 
fits was the same as the one used in the analysis of 
Experiment 1. Only two subjects (MP and NH) exhibit any 
pronounced flattening at the bottom end of the function. 
Table 3 shows the exponents (slopes) alpha of the povut 
functions (estimated by the method of least squares) ; the 
corresponding squared correlations are contained in Table 
4. Four of the seven exponents fall betweer 0.56 and 
0.64. Subject JP's exponent of 0.32 is considerably 
smaller, and subjects GA and NH have exponents of 
Intermediate size. Visual inspection and the squared 
correlations show a good fit to the power function mode... 

Figures 16-22 contain the plots of produced decibels 
versus log estimated loudness dlfferenc Visual Inspec- 
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TABLE 4 


Squared 

correlatlonr 

for the power 

function 

fits to 


Experinents 2 and 

3 


Subject 


'5s 

"V 


CJ 

.975 

.950 

.991 

.940 

GA 

.998 

.973 

.997 

.951 




(.982) 

(.892) 

JP 

99d 

.927 

.980 

.937 

JP 


.913 

.980 

,937 

MP 

.974 

.939 

.996 

.938 

MR 

.972 

.910 

.994 

.925 

NH 

.980 

.962 

.991 

.941 
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tion of these graphs also shows an Increase in linearity 
over the results from Experiment 1 (compare, for example, 
subject CJ's data in Figures 7 and 16). The least squares 
estimates of the exponents (slopes) beta and the 
correpsonding r2's are listed in Tables 3 and 4. four of 
the seven exponents full in the range 0.55 and 0.66. The 
remaining exponents are 0.42, 0.51 and 0.52. Although 
these plots and the plots of the loudness projections show 
instances in which the data depart somewhat from a simple 
power function model, the departures do not appear to be 
severe nor do they recur with every subject. . 

Recall that the objective of this experiment was to 
determine the relation between two exponents: the exponent 
underlying loudness difference (alpha) and the exponent 
underlying loudness (beta) . The prediction was that beta 
would exceed alpha. In particular, based on past results 
it was predicted that beta would be approximately twice 
the value of alpha. Tabic 3, however, shows that alpha 
and beta are very close to each other. Indeed, the 
average. difference between alpha and bet 2 is not 
significant (paired ^(6)a-1.00, £>0.1, one-tailed). This 
suggests that subjects judged loudness and loudness 
difference on the same scale, a quite unexpected result. 
Furthermore, alpha and beta fall in the vicinity of 0.60, 
the exponent of the sone scale of loudness, which is based 
on the judgments of single tone loudness ratios. 
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Past studies have found judgments of loudness and 
loudness difference to be distinct, but have primarily 
involved numeric estimates of loudness differences rather 
than difference matching. The interpretation of the 
foregoing results will heavily depend on the results of 
the remaining experiments, in which the same subjects are 
asked to magnitude estimate loudness differences and the 
loudness of single tones. 

Experiment 3 


In this experiment, subjects estimated numerically 
the ratio of loudness differences relative to a standard 
loudness difference. Data from the first session were 
ignored and the remaining data averaged by taking the 
geometric mean of the estimates for pairs (i,j) and (j,i) 
across the last three sessions. In the case of subject 
CJ, the nonmetric analysis of these averages resulted in a 
fairly large stress of 11.3%. For this reason, three 
additional sessions were run with thi*. subject and 
included in the analysis, which resulted in a reduction in 
stress (see below). The estimates of the power function 
parameters, however, were only minimally affected by the 
inclusion of these additional data. 

A further comment is required concerning the data 
from subject GA. During the debriefing followin;; che 
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fourth session^ subject GA reported having judged loudness 
ratios instead of differences throughout this experiment 
(but not the .previous one). This, according to GA, seemed 
"natural* when the task was numeric estimation. Since GA 
indicated that he could have judged differences if he had 
chosen to, he was asked to repeat the experiment, this 
time judging loudness differences. Both sets of results 
are reported below, with the ratio estimation data 
enclosed in parentheses. 

Each subject's 36 geometric means are contained in 
Appendix 2; the TORSCA valuas are listed, in Appendix 3. 
Table 2 shows the values of Kendall's coefficient of 
concordance and stress. Young's index of metric 
determinacy, indicating the amount of explained variance, 
was estimated at 97-99%. The data were reanalyzed using 
UNICON, in order to guard against suboptimal solutions. 
The squared correlations between the two sets of scale 
values were all equal to or in excess of 0.9999. 

Plots of loudness projections (on a logarithmic axis) 
versus sound pressure level in decibels are shown in 
Figures 23-29. In this experiment, the exponents (slopes) 
of these functions estimate gamma (see Equation 2). The 
least-squares estimates of gamma are contained in Table j; 
the associated r2's are listed in Table 4. The squared 
correlations and a visual inspection of the Figures show 
an excellent fit to the power function model. The 
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exponents range from 0.27 (excluding GA's exponent of 
0.06) to 0.46. 

Notice that the slope for GA's ratio estimation data 
(enclosed in parentheses) is considerably smaller than 
that based on his, (and others') difference estimation 
results. This is consistent with the hypothesis that GA 
was judging loudness ratios instead of differences, for 
the following reason. If the first results represent 
judgments of loudness ratios then the application 

of a difference model to these data requires that the 
stimulus points be interpreted as projected onto a line 
sea id in log loudness (a log interval scale ) , since 

log (Lj/Lj)«log Li-log Lj. Hence, T*alog L+ b, and on the 
assumption that loudness is a power function of intensity 

(x.e. log* loudness is linear with log I), T^a'log I-t-b'. 
This says that the TORSCA values are a logarithmic 
function, not a power function of intensity. When power 
functions are fit to functions that closely approximate 
logarithmic functions, the exponent will aproach zero (see 
Stevens, 1975). Thus, subject GA does indeed appear to 
have judged loudness ratios in the first case, but not the 
second. 

In order to estimate (1/delta) , power functions were 
fit (by the method of least squares) to the relation 
b. tween the log of the numeric response and the log 
c loudness difference (see Figures 30-36). 


The 



values o£ delta and the squared correlations are listed in 
Tables 3 and 4, respectively. Delta ranges from 0.48 to 
1.35, but five out of seven estimates are clustered 
between 0.73 to 0.85. The results indicate that the 
numerical responses in this -Experiment were nonlinearly 
related to the estimated loudness differences, since 
linearity would have produced a delta of 1.0. 

Comparison between the exponents alpha and gamma 

Table 3 shows that alpha exceeds gamma for six out of 
seven subjects. The average difference is significantly 
greater than zero (paired ^(6)»2.78, £<0.05, two-tailed). 
A subject-by-subject test for the difference between 
slopes is possible, based on the assumption that the 
relationship between the logarithm of the loudness 
projections and decibels follows a linear model (Kleinbaum 
& Kupfer, 1978). In this case, the test for the 
difference between two slopes is similar to a t-test for 
the difference between two means. Table 5 contains the 
results of these cs, together with the 95% confidence 
intervals for each individual slope. The t-tests indicate 
that for all but one subject (GA) the values of alpha and 
gamma differ significantly. 

Among the seven subjects, four (CJ, JF, MP and MR) 
show alpha considerably greater than gamma (by 
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TABLE 5 


95% confidence limits and test for the difference between 

OC and If 


Subject 0( IC t(14) 

95% CL 95% CL 


CJ 

.51 

— 

.68 

.26 - .30 

8.4 

itita 

GA 

.46 

- 

.49 

.44 - .48 

1.2 

ns 

JP 

.52 

- 

.59 

.32 - .42 

7.6 

*** 

JP 

.31 

- 

.34 

.35 - .40 

-4.9 


MP 

.51 

- 

.68 

.26 - .29 

8.8 


MR 

.54 

- 

.73 

.39 - .46 

4.9 


MB 

.46 

- 

.59 

.40 - .47 

2.7 

* 


* p < .05, two-tailed 
**♦ p < .001, two-tailed 



approxinataly a factor of 1.5-2), JP and NH show smaller 
differences in exponent, with* JP's gamma greater than 
alpha, and GA shows an insignificant difference. Notice, 
however, that the one reversal in the sign of the 
difference occurs with r.ubject JP, whose exponent for 
Experiment 2 is noticeably below the other exponents 
obtained in that experiment. Clearly this subject is 
behaving quite differently from others in the difference 
matching task, and consequently might be expected to show 
a different relation between alpha and gamma. 

One consideration in assessing the significance of 
the differences between alpha and gamma is the sensitivity 
of these exponents to the value of the additive constant 
used to optimize the power function fit. Recall that the 
loudness projections were adjusted addltively so as to 
maximize the squared correlation between the logarithms of 
the loudness projections and decibels. Examination of 
this optimization process reveals that small deviations 
from the optimum t can produce sizeable changes in the 
exponent as subsequently estimated by the method of least 
squares. In Table 6 are listed the upper and lower bounds 
for each exponent, as r2 is allowed to deviate from its 
optimum by 0.001 and 0.005. It can be seen that for 
subjects CJ, JF, MP <.nd MR the boundaries around the 
exponents show no or only marginal overlap between 
Experiments 2 and 3, The other subjects have mor 
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TABLE 6 


Bounds on 

0( and ^ for 

changes of 

.001 and 

.005 


in the 

maximum r2 of 

the power 

function 

fit 

Subject 


CK 


r 



lower 

upper 

lower 

upper 

CJ 

.001 

.55 

.63 

.24 

.32 


.005 

.50 

.68 

.19 

.36 

GA 

.001 

.43 

.52 

.42 

.50 


.005 

.37 

.58 

.36 

.55 

JF 

.001 

.50 

.61 

.36 

.41 


.005 

.44 

.67 

.29 

.45 

JP 

.001 

.27 

.37 

.32 

.43 


.005 

.21 

.43 

.24 

.50 

MP 

.001 

in 

e 

.66 

.22 

.32 


.005 

.47 

.73 

.16 

.38 

MR 

.001 

.60 

.67 

.39 

.47 


.005 

.56 

.71 

.33 

.51 

MU 

.001 

.45 

.59 

.38 

.49 


.005 

.37 

.69 

.31 

.56 



extensive areas of overlap. These results, of course, do 
not carry the force of a statistical test. However, 
insofar as a change in the total explained variance of 
0.001 or 0.005 is interpreted as defining a confidence 
region, the above analysis suggests that for four subjects 
the differences between alpha and gamma are real. 

A similar pattern of results emerges if one considers 
the correlations between the TORSCA values of Experiments 
2 and 3. For subjects GA and JP the r2's are in excess of 
0.997, whereas for subjects CJ, MP and MH the r^'s do not 
exceed 0.965. Subjects JF and NH fall in the middle with 
r-squares of 0.987 and 0.989, respectively. All these 
squared correlations are high, since in each experiment 
the scale values aro> highly correlated with stimulus 
level. However, within their narrow range, these 
correlations mirror tne differences in exponents, which 
n>‘ovides some reassurance that the application of power 
functions has not seriously distorted the relations 
between the two sets of scale values. 

Since the analysis of the raw data (the decibels 
produced and the numbers uttered) is nonmetric, the 
differences between the data from Experiments 2 and 3 
should be evident in a comparison between the ranks 
assigned to the stimulus pairs under the two procedures. 
An example of such a comparison in presented in Figure 37 
(other comparisons lead to similar conclusions) . The 
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picture is consistent with what would be expected on the 
basis of differences in the exponents of the underlying 
power functions. The identity of the extreme pairs (that 
is, the pairs ranked lowest and highest) does not change, 
but there is movement in the middle. Examination of the 
raw averaoes (see Appendix 2) reveals that a pair of 
i..cense ccnes tends to receive a higher rank (representing 
a larger difference) based on difference matches than 
numeric estimates. For example, in Experiment 2, subject 
CJ judged the loudness difference between 85 and 90 dB to 
equal the loudness of a tone of 53.9 dB, thereby ranking 
it fifteenth among 36 loudness differences (the smallest 
loudness difference was ranked one). However, in 
Experiment 3, subject CJ gave the same pair a magnitude 
estimate of 11.69, thereby ranking it only seventh. The 
larger ranks for the pairs of intense tones in Experiment 
2 result in a larger spacing among the intense stimuli and 
a steeper exponent in the recovered power function. 

Experiment 4 

In this experiment, subjects provided magnitude 
estimations of the loudness of single tones, relative to a 
standard tone of 75 dB SPL, whose loudness was designated 
100. The responses were averaged within subjects by 
taking the geometric mean of the 10 estimates of each 




stiaulus. Appendix 2 contains the geometric means for 
each subject. 

In Figures 38-44, the geometric means are plotted on 
a logarithmic axis against sound pressure level in 
decibels. A linear relationship in these coordinates 
indicates a power function relation between loudness and 
sound pressure. The exponents (slopes) epsilon in N«kl^ 
were estimated by the method of least squares and are 
listed in Table 7, along with the squared correlations. 
Six out of seven values of epsilon fall in the range 0.52 
and 0.68. The exponent for subject CJ (0.26) is quite low 
by comparison. Figure 38 shows that CJ's function is 
indeed unusual and has a slope that would be even lower in 
the absence of the endpoints. Aside from this one 
subject, the other subjects provide exponents near the 
usual value of 0.60 for the m&gnitude estimation of single 


tone loudness 
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TABLE 7 


Power function exponents and squared correlations for 

Experiment 4 

Experiment 4: N « 

Subject £ ’ r2 


CJ 

.26 

.882 

GA 

.60 

.982 

JP 

CM 

in 

• 

.970 

JP 

.68 

.988 

MP 

.62 

.974 

MR 

.58 

.980 

MH 

.52 

.978 
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